Continuously varying offset mark and methods of determining overlay

ABSTRACT

The present invention relates to overlay marks and methods for determining overlay error. One aspect of the present invention relates to a continuously varying offset mark. The continuously varying offset mark is a single mark that includes over laid periodic structures, which have offsets that vary as a function of position. By way of example, the periodic structures may correspond to gratings with different values of a grating characteristic such as pitch. Another aspect of the present invention relates to methods for determining overlay error from the continuously varying offset mark. The method generally includes determining the center of symmetry of the continuously varying offset mark and comparing it to the geometric center of the mark. If there is zero overlay, the center of symmetry tends to coincide with the geometric center of the mark. If overlay is non zero (e.g., misalignment between two layers), the center of symmetry is displaced from the geometric center of the mark. The displacement in conjunction with the preset gain of the continuously varying offset mark is used to calculate the overlay error.

CROSS REFERENCE TO RELATED PATENT APPLICATION

This application claims priority to U.S. Provisional Patent Application No. 60/546,024 entitled “APPARATUS AND METHODS FOR DETERMINING OVERLAY AND USES OF SAME”, filed Feb. 18, 2004, which application is herein incorporated by reference.

This application is a continuation-in-part of U.S. patent application Ser. No. 10/729,838 entitled “APPARATUS AND METHODS FOR DETECTING OVERLAY ERRORS USING SCATTEROMETRY”, filed Dec. 5, 2003 now U.S. Pat. No. 7,317,531, and which claims priority to the following co-pending U.S. Provisional Patent Applications: (1) Application No. 60/431,314 entitled METHOD FOR DETERMINING OVERLAY ERROR BY COMPARISON BETWEEN SCATTEROMETRY SIGNALS FROM MULTIPLE OVERLAY MEASUREMENT TARGETS, by Walter D. Mieher et al., filed 5 Dec. 2002, (2) Application No. 60/440,970 entitled METHOD FOR DETERMINING OVERLAY ERROR BY COMPARISON BETWEEN SCATTEROMETRY SIGNALS FROM MULTIPLE OVERLAY MEASUREMENT TARGETS WITH SPECTROSCOPIC IMAGING OR SPECTROSCOPIC SCANNING, by Walter D. Mieher, filed 17 Jan. 2003, (3) Application No. 60/504,093 entitled APPARATUS AND METHODS FOR DETECTING OVERLAY ERRORS USING SCATTEROMETRY, by Walter D. Mieher, filed 19 Sep. 2003, (4) Application No. 60/449,496 entitled METHOD AND SYSTEM FOR DETERMINING OVERLAY ERRORS BASED ON SCATTEROMETRY SIGNALS ACQUIRED FROM MULTIPLE OVERLAY MEASUREMENT PATTERNS, by Walter D. Mieher, filed 22 Feb. 2003, and (5) Application No. 60/498,524, filed 27 Aug. 2003, entitled “METHOD AND APPARATUS COMBINING IMAGING AND SCATTEROMETRY FOR OVERLAY METROLOGY”, by Mike Adel. These applications are herein incorporated by reference in their entirety.

This application is related to U.S. patent application Ser. No. 10/785,396 entitled “APPARATUS AND METHODS FOR DETECTING OVERLAY ERRORS USING SCATTEROMETRY”, filed Feb. 22, 2004, which application is herein incorporated by reference.

FIELD OF THE INVENTION

The present invention relates to determination of overlay between structures formed in single or multiple layers. More particularly, it relates to determining overlay based on diffraction of radiation interacting with such structures.

BACKGROUND OF THE INVENTION

In various manufacturing and production environments, there is a need to control alignment between various layers of samples, or within particular layers of such samples. For example, in the semiconductor manufacturing industry, electronic devices may be produced by fabricating a series of layers on a substrate, some or all of the layers including various structures. The relative position of such structures both within particular layers and with respect to structures in other layers is relevant or even critical to the performance of completed electronic devices.

The relative position of structures within such a sample is sometimes called overlay. Various technology and processes for measuring overlay have been developed and employed with varying degrees of success. More recently, various efforts have been focused on utilizing radiation scatterometry as a basis for overlay metrology.

Certain existing approaches to determining overlay from scatterometry measurements concentrate on comparison of the measured spectra to calculated theoretical spectra based on model shape profiles, overlay, and film stack, and material optical properties (n,k dispersion curves), or comparison to a reference signal from a calibration wafer.

Existing approaches have several associated disadvantages. For example, a relatively large number of parameters must be included in the profile, overlay, and film modeling to accurately determine the overlay. For example, in some approaches using simple trapezoidal models for both the upper and lower layer profiles, the minimum number of pattern parameters that must be included is seven, including overlay. If film thicknesses variation is included in the model, the number of parameters increases correspondingly. A large number of parameters could require increased processing resources, may introduce corresponding errors, and may delay the results, thereby possibly decreasing throughput and increasing inefficiencies and costs. For example, comparison of a measured spectrum to calculated reference spectra takes longer with more parameters, whether a library-based approach is used or a regression approach is used.

Another disadvantage of certain existing approaches to determination of overlay based on scatterometry is the detailed knowledge of the film stack, film materials, and pattern element profiles that may be required to determine accurate theoretical spectra to compare to the measured spectra.

Yet another disadvantage of certain existing approaches to determination of overlay based on scatterometry is the accurate knowledge of the scatterometry optical system that may be required to determine accurate theoretical spectra to compare to the measured spectra.

Therefore, in light of the deficiencies of existing approaches to determination of overlay based on scatterometry, there is a need for improved systems and methods for determination of overlay based on scatterometry.

SUMMARY OF THE INVENTION

The invention relates, in one embodiment, to a method of extracting overlay information from an overlay target having over laid first and second periodic structures designed with a preset gain factor. The method includes obtaining a spatial array of optical data along the overlay target. The method also includes determining the center of symmetry of the overlay target based on the spatial array of optical data. The method further includes calculating a distance X between the center of symmetry and the geometric center of the overlay target. The method additionally includes determining the overlay error by dividing the distance X by the preset gain factor.

The invention relates, in another embodiment, to an overlay target. The overlay target includes a first periodic structure located on a first layer. The first periodic structure has a first value of an attribute of a periodic structure. The overlay target also includes a second periodic structure located on a second layer, and positioned over the first periodic structure. The second periodic structure has a second value of the attribute of a periodic structure. The first and second values have a substantially small difference.

The invention relates, in another embodiment, to an overlay target. The overlay target includes a first grating positioned in a first layer and having a first pitch. The overlay target also includes a second grating positioned in a second layer and having a second pitch. The first and second pitches have a substantially small difference.

The invention relates, in another embodiment, to an overlay target. The overlay target includes a first portion located on a first layer. The first portion includes a first grating having a first value of a grating characteristic. The overlay target also includes a second portion located on a second layer and centered above the first portion. The first and second portions have substantially the same dimensional profile. The second portion includes a second grating and third grating that are positioned side by side. The second grating has a second value of a grating characteristic, and the third grating has a third value of a grating characteristic. The second value is slightly larger than the first value, and the third value is slightly smaller than the first value.

The invention relates, in another embodiment, to a method for determining overlay error. The method includes providing an overlay target comprising over laid periodic structures. Each of the overlaid periodic structures has a different value for an attribute of the periodic structures. The different values for the attribute of the periodic structures having a substantially small difference. The method also includes obtaining an array of spectra across the overlay target. The method further includes finding the null point of the array of spectra. The null point being the point that produces a difference spectra that is substantially zero over the entire spectral range. The null point indicating the center of symmetry of the overlay target. The method additionally includes calculating a distance X between the center of symmetry and the geometric center of the overlay target. The method also includes calculating a ratio D between one of the values or an average of the values and the difference between the first and second values. Moreover, the method includes determining the overlay error by dividing the distance X by the ratio D.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the relative distribution of designed overlay offsets Xa, Xb, Xc, and Xd for corresponding interlayer patterns (overlay targets) A, B, C, and D according to an embodiment of the present invention.

FIG. 2( a) is a side view illustration of a patterned top layer L2 being offset by an amount +F from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

FIG. 2( b) is a side view illustration of a patterned top layer L2 being offset by an amount −F from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

FIG. 2( c) is a side view illustration of a patterned top layer L2 being offset by an amount +F+f0 from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

FIG. 2( d) is a side view illustration of a patterned top layer L2 being offset by an amount −F+f0 from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

FIG. 2( e) is a side view illustration of a patterned top layer L2 being offset by an amount +F+f0+E from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

FIG. 2( f) is a side view illustration of a patterned top layer L2 being offset by an amount −F+f0+E from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

FIG. 3( a) is a flow diagram illustrating a procedure for determining overlay in accordance with one embodiment of the present invention.

FIG. 3( b) shows a graphical representation of an approach to determination of overlay according to an embodiment of the present invention.

FIG. 4 is a diagrammatic representation of a conventional microscopic imaging system.

FIG. 5( a) is diagrammatic representation of a microscopic imaging system having a numerical aperture (NA) optimized for scattering characteristics in accordance with a first embodiment of the present invention.

FIG. 5( b) is diagrammatic representation of a microscopic imaging system having a numerical aperture (NA) optimized for scattering characteristics in accordance with a second embodiment of the present invention.

FIG. 5( c) is diagrammatic representation of a microscopic imaging system having a numerical aperture (NA) optimized for scattering characteristics in accordance with a third embodiment of the present invention.

FIG. 5( d) is diagrammatic representation of a microscopic imaging system having a numerical aperture (NA) optimized for scattering characteristics in accordance with a fourth embodiment of the present invention.

FIG. 5( e) is a top view representation of an imaging spectrometer, multiple site field-of-view example in accordance with one embodiment of the present invention.

FIG. 6 is a diagrammatic representation of a system for selecting one or more wavelength ranges in accordance with one embodiment of the present invention.

FIG. 7 is a diagrammatic representation of a simultaneous, multiple angle of incidence ellipsometer.

FIG. 8 is a schematic view of a spectroscopic scatterometer system in accordance with one embodiment of the present invention.

FIG. 9( a) shows a plurality of targets placed substantially-collinearly along either an X-direction or a Y-direction in accordance with a first embodiment of the present invention.

FIG. 9( b) shows four targets disposed collinearly along the X-dimension, and four targets disposed collinearly along the Y-dimension in accordance with a second embodiment of the present invention.

FIG. 10 shows an example incidence line and field of view for use in a technique for scatterometric overlay determination using an incidence line in accordance with one embodiment of the present invention.

FIG. 11 a is a top view representation of a first combination imaging and scatterometry target embodiment.

FIG. 11 b is a top view representation of a second combination imaging and scatterometry target embodiment.

FIG. 11 c is a top view representation of a third combination imaging and scatterometry target embodiment.

FIG. 12 is a diagram of a combined mark, in accordance with one embodiment of the present invention.

FIGS. 13A-13D show variations of a combined metrology tool, in accordance with several embodiments of the present invention.

FIG. 14 is a flow diagram using a combined metrology tool, in accordance with one embodiment of the present invention.

FIG. 15 is a flow diagram illustrating a method of determining overlay error, in accordance with one embodiment of the present invention.

FIG. 16 is a diagram of a one dimensional overlay target, in accordance with one embodiment of the present invention.

FIGS. 17A and 17B are top view diagrams of a one-dimensional overlay target, in accordance with one embodiment of the present invention.

FIGS. 18A and 18B are side view diagram of a one dimensional overlay target, in accordance with one embodiment of the present invention.

FIGS. 19A and 19B are top view diagrams of a one-dimensional overlay target, in accordance with one embodiment of the present invention.

FIG. 20A and 20B are side view diagram of a one dimensional overlay target, in accordance with one embodiment of the present invention.

FIG. 21 is a diagram of a one dimensional overlay target, in accordance with another embodiment of the present invention.

FIGS. 22A and 22B are diagrams of a two-dimensional overlay target, in accordance with one embodiment of the present invention.

FIG. 23 is a diagram of an overlay region, in accordance with one embodiment of the present invention.

FIG. 24A and 24B are diagrams of an overlay region, in accordance with one embodiment of the present invention.

DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS

Reference will now be made in detail to a specific embodiment of the invention. An example of this embodiment is illustrated in the accompanying drawings. While the invention will be described in conjunction with this specific embodiment, it will be understood that it is not intended to limit the invention to one embodiment. On the contrary, it is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. The present invention may be practiced without some or all of these specific details. In other instances, well known process operations have not been described in detail in order not to unnecessarily obscure the present invention.

An aspect of the present invention provides a set of four or more scatterometry overlay targets which have been formed on a sample, such as a semiconductor device. A pattern could also be described as a “pattern or interlayer pattern”, with the two terms being synonymous under most circumstances. In a particular implementation, the sample has two or more layers of a semiconductor device, and the targets are utilized to provide a measure of the placement accuracy of various structures comprised in the device. In general, placement accuracy is characterized by measurement of an overlay error between two different layers of the semiconductor device.

In a specific embodiment, a set of four targets are provided, and each target includes two sets of structures on two different layers which are offset from each other. In a specific implementation, an offset may be defined as the sum or the difference of two separate distances: a first distance F and a second distance f0, with F being greater than f0. Denoting the four targets as “target A”, “target B”, “target C” and “target D”, the corresponding predetermined offsets for each of these targets may be defined as follows for a specific target design: Xa=+F+f0 (for target A), Xb=−F+f0 (for target B), Xc=+F−f0 (for target C), and Xd=−F−f0 (for target D).

The offsets for Xa through Xd may be any suitable value for practicing the techniques of the present invention so as to determine overlay. For example, Xa and Xb may have different values of f0 than Xc and Xd.

FIG. 1 illustrates the distribution of offsets Xa, Xb, Xc and Xd along the x axis in a particular implementation of the invention. As shown, offsets Xa and Xc are both positive with Xa being larger than Xc. In contrast, offsets Xb and Xd are both negative with Xd being more negative than Xb.

The number of targets and the magnitude and sense of their corresponding offsets may be chosen in any suitable manner so that the techniques of the present invention may be practiced to determine overlay error. A specific set of targets and their corresponding offsets are described below in relation to FIGS. 2( a) through 2(f). It should be readily apparent that there are numerous combinations of targets and offset values which may be utilized to practice the techniques and utilize the systems of the present invention.

FIG. 2( a) is a side view illustration of a patterned top layer L2 being offset by an amount F from a patterned bottom layer L1 in accordance with one embodiment of the present invention. Each layer L1 and L2 is patterned into a set of structures. A structure may include any suitable feature, such as a line, trench or a contact. A structure may be designed to be similar to a semiconductor device feature. A structure may also be formed from a combination of different features. Further, a structure may be located on any layer of the sample, e.g., either above the top layer of the sample, within any layer of the sample, or partially or completely within a layer of the sample. In the illustrated embodiment of FIG. 2( a), layer L1 includes the complete structures 204 a-c, while layer L2 includes the complete structures 202 a-c. Construction of scatterometry overlay targets structures and methods for producing them are described in U.S. patent application, having application Ser. No. 09/833,084, filed 10 Apr. 2001, entitled “PERIODIC PATTERNS AND TECHNIQUE TO CONTROL MISALIGNMENT”, by Abdulhalim, et al., which application is herein incorporated by reference in its entirety.

As shown, the structures of the top layer L2 are offset by an amount F from the structures of the bottom layer L1. The structures of the two offset layers may be located within adjacent layers or have any suitable number and types of layers disposed in between the two offset layers. FIG. 2( a) also shows three films T1, T2, and T3 between patterned layers L1 and L2 and their corresponding structures. To the extent that any other layers exist between the two layers having the structures, these other layers exhibit at least a minimum degree of transmission for electromagnetic radiation to permit propagation of the radiation between the layers having the structures.

FIG. 2( b) is a side view illustration of a patterned top layer L2 being offset by an amount −F from a patterned bottom layer L1 in accordance with one embodiment of the present invention. FIG. 2( c) is a side view illustration of a patterned top layer L2 being offset by an amount +F+f0 from a patterned bottom layer L1 in accordance with one embodiment of the present invention. FIG. 2( d) is a side view illustration of a patterned top layer L2 being offset by an amount −F+f0 from a patterned bottom layer L1 in accordance with one embodiment of the present invention. FIG. 2( e) is a side view illustration of a patterned top layer L2 being offset by an amount +F+f0+E from a patterned bottom layer L1 in accordance with one embodiment of the present invention. FIG. 2( f) is a side view illustration of a patterned top layer L2 being offset by an amount −F+f0+E from a patterned bottom layer L1 in accordance with one embodiment of the present invention.

In general, an error offset E may be determined by analyzing at least the measured spectra from four or more targets A, B, C, and D each having offsets between two patterned layers, such as offsets Xa through Xd. This analysis is performed without comparing any of the spectra to a known or reference spectra. In other words, the error determination techniques of the present invention do not require a calibration operation.

FIG. 3( a) is a flow diagram illustrating a procedure 300 for determining overlay in accordance with one embodiment of the present invention. In this example, four targets A, B, C, and D are used which are designed to have offsets Xa through Xd as described above. That is, target A is designed with offset +F+f0; target B with offset −F+f0; target C with offset +F−f0; and target D with offset −F−f0.

Initially, an incident radiation beam is directed towards each of the four targets A, B, C, and D to measure four spectra S_(A), S_(B), S_(C), and S_(D) from the four targets in operations 302 a through 302 d. Operations 302 a through 302 d may be carried out sequentially or simultaneously depending on the measurement system's capabilities. The incident beam may be any suitable form of electromagnetic radiation, such as laser or broadband radiation. Examples of optical systems and methods for measuring scatterometry signals to determine overlay may be found in (1) U.S. patent application, having patent No. 09/849,622, filed 04 May 2001, entitled “METHOD AND SYSTEMS FOR LITHOGRAPHY PROCESS CONTROL”, by Lakkapragada, Suresh, et al. and (2) U.S. patent application, having application Ser. No. 09/833,084, filed 10 Apr. 2001, entitled “PERIODIC PATTERNS AND TECHNIQUE TO CONTROL MISALIGNMENT”, by Abdulhalim, et al. These applications are herein incorporated by reference in their entirety.

Further embodiments of suitable measurement systems and their use for determining overlay error are further described below. In various embodiments of the present invention, the spectra S_(A), S_(B), S_(C), and S_(D) (and any additional spectra that may be present) could include any type of spectroscopic ellipsometry or reflectometry signals, including: tan(Ψ), cos(Δ), Rs, Rp, R, □ (spectroscopic ellipsometry “alpha” signal), □ (spectroscopic ellipsometry “beta” signal),((Rs−Rp)/(Rs+Rp)), etc.

Spectrum S_(B)(−F+f0) is then subtracted from spectrum S_(A)(+F+f0), and spectrum S_(D)(−F−f0) is subtracted from spectrum S_(C)(+F−f0) to form two difference spectra D1 and D2 in operations 304 a and 304 b, respectively. Next, a difference spectrum property P1 is obtained from the difference spectra D1 and a difference spectrum property P2 is obtained from the difference spectrum D2 in operations 306 a and 306 b, respectively. The difference spectra properties P1 and P2 are generally obtained from any suitable characteristic of the obtained difference spectra D1 and D2. The difference spectra properties P1 and P2 may also each simply be a point on the each difference spectra D1 or D2 at a particular wavelength. By way of other examples, difference spectra properties P1 and P2 may be the result of an integration of averaging of the difference signal, equal the an average of the SE alpha signal, equal a weighted average which accounts for instrument sensitivity, noise or signal sensitivity to overlay.

After difference spectra properties P1 and P2 are obtained, the overlay error E may then be calculated directly from the difference spectra properties P1 and P2 in operation 308. In one embodiment, a linear approximation is performed based on the difference spectra properties P1 and P2 to determine the overlay error E, while in another technique the difference spectra properties P1 and P2 are used to approximate a sine wave function or other periodic function which is then used to determine the overlay error E. One linear regression technique is illustrated below with respect to FIG. 3( b). In one example, the overlay result may be obtained by a statistical calculation (e.g. averaging or weighted averaging) of overlay results obtained from properties of multiple wavelengths or multiple wavelength ranges.

In a variation of this implementation, if all four targets have the same characteristics, such as pitch P, thin film characteristics, structure size and composition, except for the offsets, and assuming that Xa and Xb are opposite in sign and have the same order of magnitude (0.1<Xa/Xb<10), and if 0.05<|Xa/P|<0.45 and 0.05<|Xb/P|<0.45, and if Xa is the same sign as Xc and Xb is the same sign as Xd, an estimate of the overlay error E present between structures within the interlayer targets can be calculated as follows using a linear approximation based on the difference spectra properties P1 and P2: E′=((S _(C) −S _(D))*(Xa+Xb)/2+(S _(A) −S _(B))*(Xc+Xd)/2)/((S _(A) −S _(B))−(S _(C) −S _(D))) or E′=(P2*(Xa+Xb)/2+P1(Xc+Xd)/2)/(P1−P2) where the difference spectra properties P1 and P2 are generally opposite in sign for overlay errors E<f0. If (Xa−Xb)=(Xc−Xd) and E=0 then P1=−1*P2.

Alternatively, if the same values for F and f0 are used for designing each target offset Xa, Xb, Xc, and Xd, then E′=(f0*P2+f0*P1)/(P1−P2).

The targets may be used to determine overlay of structures located at least partially in more than one layer, but could also be used to determine overlay of structures located substantially in a single layer.

FIG. 3( b) shows a graphical representation of the linear approach for determining the overlay error E in accordance with one embodiment of the present invention. As shown, the positive portion of the y axis shows a change in the difference spectra property P1 as a function of f0+E and the negative portion of the y axis shows a change in the difference spectra as a function of −f0+E. As described above, the difference spectra properties P1 and P2 are obtained from the difference spectra D1 and D2.

The overlay error E may be obtained by analyzing the two points (+f0+E, P1) and (−f0+E, P2). The overlay error E may be determined in one approach by performing a linear approximation with the two obtained difference spectra properties P1 and P2. Note that there are two points on the graph where E is zero, while other portions of the graph are a function of the overlay error E and f0. If the offsets are chosen carefully so as to be in the linear region, then the slope of the positive portion of the graph (P1/(+f0+E)) should equal the slope of the negative portion of the graph (P2/(−f0+E). Thus, the overlay error is given by E=f0*(P1+P2)/(P1−P2).

According to an implementation of the invention, if there are two targets with offsets +F and −F that are equal in magnitude but opposite in sign and no other overlay errors, then the 0th diffraction order scatterometry SE or reflectometry spectra are substantially identical from these two targets (to a good approximation) and the difference signal between the spectra corresponding to +F and −F is zero. Of course, any property of the difference signal is also zero. If one deliberately breaks the symmetry (artificially induces an overlay error) by designing an additional offset +f0, then the difference signal D1 is no longer zero and any suitable difference spectra property follows the same relationship as for an overlay error E. Similarly one can design another set of overlay targets with an additional offset −f0. Thus, the overlay error may be determined using a property of the difference signals D1(+F+f0, −F+f0) and D2(+F−f0), −F−f0), and accordingly no separate calibration step is required.

It should be understood that when the overlay error E is calculated from the spectra signals, it may be an estimate of actual overlay error. The calculated overlay error E may be denoted as the overlay error (E), or an estimate of the overlay error (E′).

If the pitch between structures is relatively large then the above described linear approximation techniques generally work well. However, when the pitch is relatively small then additional targets may be produced on the sample to improve the accuracy of the overlay measurements. The number of targets and corresponding scatterometry techniques which are used depend on the particular materials of the target and the scatterometry signal type implemented, among other factors. Whether to use four or more targets can be determined experimentally or by well known modeling methods. In one embodiment, two additional interlayer targets (denoted targets “H” and “J”) are produced on the sample, with corresponding offsets Xh and Xj. Upon being illuminated by incident radiation, the targets H and J produce corresponding diffracted components, which can serve as a basis for determination of an additional difference signal D3 and difference spectra property P3. This property P3 may be analyzed in connection with the difference spectra properties P1 and P2 to refine the determination of the overlay E to include non-linear corrections or measurements of the errors introduced by using a linear approximation.

One alternative embodiment to the linear approximation methods discussed above is to treat the scatterometry overlay signal as a periodic function and use phase detection methods to determine the overlay error. This embodiment may be preferred in some circumstances, depending on variables that may include scatterometry overlay target pitch, scatterometry overlay target design, scatterometry overlay (SCOL) target materials, the measured scatterometry signal, and the like.

The overlay error may be extracted from measuring multiple SCOL targets with pre-programmed additional built-in overlay offsets. (One example of the preprogrammed offsets could be Xa, Xb, Xc, and Xd as discussed above and in FIG. 1). The number of targets measured may be two, three, four, or more than four, or may vary between different overlay measurement locations. A scatterometry signal (as a function of the wavelength or incident angle, for example) is acquired from the required SCOL targets. For every arbitrary overlay, this signal is a periodic and even function of overlay error. A phase detection (or phase retrieval, phase extraction, or phase determination) algorithm utilizes these properties of the signals. The measured signal is represented by a set of even periodic functions with a corresponding number of free parameters (one of these free parameters is the overlay error itself). Different sets of such functions may be used, depending, for example, on the number of targets measured, scatterometry signal properties, target properties, and information required. The number of targets measured is to be greater or equal to the cumulative number of free unknown parameters. When several (two or more) scatterometry overlay (SCOL) targets (with different pre-programmed offsets) are placed in the immediate vicinity of each other (within 250 microns, for example), the overlay error may be assumed to be the same for all these targets. Each of the other free parameters can either vary or not vary from one SCOL target location to the other one (within the field and/or across the wafer). (Overlay is assumed to vary between different overlay measurement locations). Alternatively, these free parameters (or some of them) may either vary or not vary between X- and Y-SCOL target orientations. Based on the information required, the measurement accuracy required and on whether some free parameters are not varying location-to-location and/or between X- and Y-orientations, the total number of SCOL targets per overlay measurement location and total number of SCOL targets to be measured per field and/or per wafer is determined.

An example of a phase algorithm approach to determining overlay error from scatterometry signals from multiple targets is to treat the dependence of the scatterometry signals on overlay error as a periodic function. In this case the programmed offsets of the multiple targets are treated as initial phase offsets and the overlay error is treated as an additional phase. The overlay error can then be determined using well-known phase determination or phase retrieval methods. Well known phase retrieval methods that may include quadrature, 3-bucket, and 4-bucket phase retrieval algorithms can be used to determine overlay error. These phase retrieval methods are listed as examples only and are not meant to limit the scope of the invention. Phase detection methods are well known and are commonly used in diverse areas such as communications, interferometry, nuclear magnetic resonance, electronic circuits, to list a few examples. In another embodiment, a combination of linear, non-linear, and phase retrieval algorithms may be employed to determine the overlay error.

Certain conditions are preferably met with implementation of the above described techniques. The measurement areas are substantially identical in all aspects except for the offsets, e.g., +F+f0, −F+f0, +F−f0, and −F−f0. This is likely accomplished by placing the targets within about 100 microns or less of each other and by choosing targets which are relatively-robust to the process (i.e. they have similar or less sensitivity to process variation as the device features). In practice, on production wafers, the profiles may deviate from identical for different offsets if the topography from the lower pattern layer(s) and the upper layer changes in response to interacting with this topography. A difference or error signal between the two targets with different offsets is relatively independent of profile variation of the overlay target segments and to film variation as long as the profiles are common to the different targets. This is the substantial equivalent of common mode rejection of the parts of the signal that are determined by the profile and the films and the optics. The technique is also preferably robust to the range of process variation encountered in a typical manufacturing process. The signal differences due to the overlay error are also preferably larger than the signal differences due to other sources of process variation between the nearby scatterometry overlay targets (including mask errors).

If in a particular implementation the targets include structures grouped to exhibit the characteristics of lines, then a separate set of targets may be required for X and Y overlay measurements. If the overlay targets are composed of 2-dimensional structures (as seen from a top down view), then it may be possible to use one set of targets to get both X and Y overlay information. For oblique scatterometry, according to a specific implementation, it may be advantageous to rotate the orientation of the wafer with respect to the optical scattering plane to measure the different X and Y overlay errors. For true normal incidence, it may be possible to get both X and Y overlay information from the different polarizations without rotating the wafer or the optics.

Cartesian coordinates provide a convenient frame of reference for measuring overlay within a sample, with the x-y plane being located within, or substantially parallel with, a layer of the sample, and with the z axis being substantially perpendicular to the layers of the sample. The Cartesian coordinate system could be fixed with respect to the sample or could be rotated to reduce the complexity of the measurements. For example, overlay occurring diagonally across the sample but within a single layer could be described as two-dimensional x-y overlay in a Cartesian system with the x-y axes substantially parallel with the sides of a rectangular sample. That same diagonal overlay could be measured along a single axis, however, by rotating the x-y axes such that the x axis is parallel with the direction of the diagonal overlay. Three dimensional overlay could be reduced to two-dimensional overlay by restricting measurements within an x-y plane substantially parallel with a layer and ignoring any interlayer overlay occurring in the z direction.

In one embodiment, targets include more than one predefined offset, possibly between different sets of structures located in two layers, or possibly between different sets of structures located in more than two layers. In a general case, a target may include an indefinite number of layers, with all or some of these layers having structures producing predefined offsets. In a particular implementation, the structures in one or more underlying patterned layers of a target cause changes in the shape or topography of one or more upper layers (disposed above the underlying patterned layer(s)). In this implementation, the one or more upper layers may be substantially or partially opaque or absorbing, and at least part of the diffraction signal may arise from the topography of an upper layer, the topography arising at least in part from the underlying patterned layer.

According to one embodiment, structures included in a target may be organized in various configurations and shapes, including, for example, lines, grids, rectangles, squares, curved lines, curved shapes, or combinations of the foregoing. Such configurations of structures may be disposed at various locations within the target, and may describe various angles with respect to the electromagnetic radiation incident on the target. For example, the sets of structures could be organized as a set of parallel lines perpendicular to the direction of propagation of a collimated set of radiation rays or of a beam incident on the target. In another case, the structures organized as a set of parallel lines could be disposed at an acute angle with respect to the incident radiation, possibly at an angle of 45 degrees. Such a configuration could be advantageous by facilitating determination of overlay in both x and y directions, thereby reducing the need for additional overlay patterns or measurements.

Scatterometry System Embodiments and Use of Same

Several of the techniques of the present invention may be implemented using any suitable combination of software and/or hardware system. For example, the techniques may be implemented within an overlay metrology tool. Preferably, such metrology tool is integrated with a computer system which implements many of the operations of this invention. Such composite system preferably includes at least a scatterometry module for obtaining scatterometry signals of the overlay targets, and a processor configured to analyze the obtained scatterometry signals to thereby determine overlay error within such targets. At a minimum, the scatterometry module will usually include (i) a source of illumination oriented to direct radiation onto a specified location of the sample; and (ii) one or more detectors oriented to detect a scatterometry signal which has been scattered by the sample.

At least a portion of the techniques of the present invention may also be implemented in an overlay metrology system as an additional overlay measurement capability which complements an overlay measurement system or sub-system based on image analysis such as one used for conventional box-in-box or frame-in-frame overlay targets or other imaging type overlay measurement structures. Examples of apparatus which combine imaging-based overlay metrology and scatterometry-based overlay are described in the above referenced Provisional Application No. 60/498,524, which is incorporated here by reference. Overlay data from imaging overlay measurements and scatterometry overlay measurements may be combined for various uses including: calculating the overlay correctables, calculating other overlay corrections, calculating overlay errors at other locations on the wafer. More use cases for combinations of imaging overlay metrology and scatterometry overlay metrology are also described in above referenced Provisional Application No. 60/498,524.

Regardless of the system's configuration, it may employ one or more memories or memory modules configured to store data, program instructions for the general-purpose inspection operations and/or the inventive techniques described herein. The program instructions may control the operation of an operating system and/or one or more applications. The memory or memories may also be configured to store scatterometry data obtained from the targets and overlay error results and optionally other overlay measurement data.

Because such information and program instructions may be employed to implement the systems/methods described herein, embodiments of the present invention relates to machine readable media that include program instructions, state information, etc. for performing various operations described herein. Examples of machine-readable media include, but are not limited to, magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD-ROM disks; magneto-optical media such as floptical disks; and hardware devices that are specially configured to store and perform program instructions, such as read-only memory devices (ROM) and random access memory (RAM). The invention may also be embodied in a carrier wave traveling over an appropriate medium such as airwaves, optical lines, electric lines, etc. Examples of program instructions include both machine code, such as produced by a compiler, and files containing higher level code that may be executed by the computer using an interpreter.

Several of the system embodiments described below are mainly described and illustrated with respect to a scatterometry module or components for obtaining spectra from four or more targets, while the processor and memory are not shown.

Imaging Metrology Systems in Which the Numerical Aperture is Optimized for Measurement of Scattering Structures

FIG. 4 is a diagrammatic representation of a microscopic imaging system. As shown, the imaging system 400 includes a beam generator 402 for producing an incident beam 403 of electromagnetic radiation, a beam splitter 404 for directing the incident beam 405 towards the sample 408. Typically, the incident beam is focused onto the sample by an objective lens 406. An output beam 409 is then emitted from the sample in response to the incident beam and passed through the beam splitter 404 through relay lens 410 onto imager or camera 412. The camera 412 generates an image of the sample based on the output beam 409.

The system 400 also includes a processor and one or more memory 414 which are configured to control the various components, such as the beam generator 402, objective lens 406, and cameral 412. The processor and memory are also configured to analyze the detected output beam or image implementing the various scatterometry techniques described above.

Traditionally, such imaging systems (such as those used for overlay) have selected numerical apertures (NA's) (e.g., via objective lens 406) to optimize image resolution and to minimize optical aberrations. Selection of NA is typically performed in order to derive the overlay information from the variation in intensity over a single target (such as a box-in-box target) from the geometrical properties of the image.

Conventional imaging systems have relied upon high numerical apertures (NA's), such as 0.7 to 0.9, but doing so results in an expensive optical system which is sensitive to vibration, depth of focus, and optical aberrations. These issues reduce the achievable precision and cause a measurement error referred to as “tool induced shift” or TIS.

Scatterometry systems may take measurements at multiple sites sequentially in order to measure both x and y overlay and to eliminate effects due to variations in other sample parameters, such as film thickness. This type of measurement process results in significantly slower operation of the scatterometry tool relative to conventional overlay techniques.

In one embodiment of the present invention, the illumination and imaging NA's of an imaging optical system are chosen to optimize the performance of the instrument on scattering structures by ensuring that only the zero'th diffraction order is collected. One may take advantage of the fact that there exist performance advantages for certain metrology or inspection tasks pertaining to periodic structures when only the zero order diffraction is collected by the detection system. Under this condition, only the specular reflection is collected. Since the output which is scattered out of the specular is not collected and the nonspecular output may be more sensitive to aberrations, collection of only the specular output tends to minimize effects caused by optics aberrations. This condition also will result in a tool which will be optimized for relative photometric measurements of multiple sites in the field of view as described further below. Very low TIS may also be achieved, as compared to conventional imaging systems. Much higher throughput may also be achieved than with conventional scatterometry systems.

Choosing the illumination and imaging NA's for a particular imaging system is based on the particular configuration of such system. If we now consider the simplest imaging system in which the numerical aperture NA of illumination and collection are the same and the incident beam in normal to the sample surface, then the condition of “zero order diffraction only” can be met if: nλ>2dNA, where n=1. where d is the pitch of the structures of the targets being imaged. This can be restated in terms of the numerical apertures of illumination NA_(i) and collection NA_(c) of the imaging system as: nλ=d(NA _(i) +NA _(c))

This equation indicates that if we are able to constrain the numerical aperture of the illumination system, we can relax the constraint on the numerical aperture of the collection optics, which may be advantageous under certain conditions. Thus, the spectral range may be restricted to wavelengths greater than twice the product of the pitch and the NA. Under realistic conditions the scattered radiation beam will be wider (more divergent) than the illumination beam. Under realistic circumstances however, infinitely periodic gratings are not imaged and so the above equations become approximations and the diffracted plane waves become somewhat divergent. So it may be preferable to include a margin of safety in the constraint and require that: nλ≧2dNA(1+ε), where n=1 and □ is small, typically less than 0.5.

As an example, for an NA 0.4 imaging system, wavelengths may be restricted to values greater than 0.8 times the largest pitch, which does not seem to be an unreasonable constraint. For periodic structures having features of design rule 70 nm and below, the densest structures with pitches as low as 200 nm does not constrain the spectral range of imaging systems with operating wavelengths equal to about 200 nm or longer, while more isolated features with pitches as large as 500 nm are preferably measured with wavelengths longer than 400 nm.

It is preferable to account for these constraints when designing an imaging spectrometer for metrology and inspection applications. A limit on the spatial resolution of the imaging system is the numerical aperture of the system. It is advantageous to achieve the highest spatial resolution so as to be able to shrink to a minimum the size of metrology structures and conserve valuable wafer real estate. Restated, this allows minimization of proximity effects or “crosstalk” between adjacent features in the field of view of the imaging spectrometer. Therefore, the highest possible NA is achieved while meeting the constraint that the zero order diffraction is collected by the detection system.

Another interesting outcome of this constraint is that the highest possible overlay spatial resolution may be achieved without ever resolving the features under test. This may have further advantages as it should ensure that problematic aliasing phenomena are avoided in the imaging system. In a preferred embodiment, an architecture is provided in which the spectral band pass can be easily modified or selected by the measurement system or algorithm based on the largest pitch in the feature under test (e.g., as the systems in FIGS. 5A through 5D described further below). Alternatively, the NA of either illumination or collection could be easily modified, depending on the largest pitch in the feature under test. Alternatively, all of these embodiments may be implemented within a single system.

FIGS. 5A through 5E illustrate four embodiments of microscopic imaging systems having a numerical aperture (NA) optimized for scattering characteristics. As shown in FIG. 5A, the system 500 may have components which operate like the same named components of the system in FIG. 4. The system 500 further includes a wavelength selection device 520 for selecting a particular wavelength. The wavelength selection device 520 allows modification of the spectral band by selectively transmitting or selectively reflecting part or portions of the illuminating radiation. A wide variety of well known spectroscopic filtering techniques may be employed to modify the spectra band, including selecting from a set of band pass interference filters, continuously varying bandpass interference filters, grating based spectrometers, Fourier transform interferometers, acousto-optic tunable filters, to name a few. The wavelength selection device 520 is positioned within the incident beam path between the beam. The system 500 may also include a polarizer control device 522 for causing the incident beam to be in a particular polarization state and a polarization analyzer 524 for analyzing or separating out the polar components of the collected beam.

The system 530 of FIG. 5B is similar to the system 500 of FIG. 5A, except a wavelength modulation device 532 is used in place of a wavelength selection device. The system 540 of FIG. 5C is similar to the system 500 of FIG. 5A, except the wavelength selection device 542 is positioned in the output beam path. The system 550 of FIG. 5D is similar to the system 500 of FIG. 5C, except a wavelength modulation device 532 is used in place of a wavelength selection device. The wavelength modulation device 532 can be controlled by changing one or more optical path lengths in the wavelength modulation device 532 itself (e.g., an interferometric system, such as in a Michelson, Fabry-Perot, or Sagnac interferometers). The spectral information may be derived from the resulting signal with a transform analysis like a Fourier transform or Hadamard transform, for example.

FIG. 5E is a top view representation of an imaging spectrometer, multiple site field-of-view example in accordance with one embodiment of the present invention. In one implementation, spectra from pixels in each dotted box are averaged to create a spectrum for each of the four measurement targets. Alternatively, spectra from pixels located only in a central region of each dotted box are averaged together. Size and spacing of lines in the illustrated targets are exaggerated for emphasis.

The NA may be selected to ensure that only the zero'th diffraction order is collected in any suitable manner. In one proposed operational embodiment:

Two or more sites of differing characteristics are located in the field of view of the imaging system.

Images are captured over one or more spectral ranges.

For each measurement site in the field of view, all or some of the pixels determined to be within that site are summed or otherwise combined to characterize the photometric properties of that site in that spectral range.

Step 3 is repeated for each spectral range.

The results for each site over each spectral range are processed to determine the properties of the sample. For example, the above described spectral analysis techniques (i.e., F+f0) are used on the obtained spectra for each target.

Steps 1 though 5 are repeated for the plurality of measurement sites desired across the wafer.

While this example technique describes sequentially capturing images over different spectral regions, this could be accomplished simultaneously using a system of wavelength dependent beam splitters, filters, and/or mirrors. Alternatively, the same could be affected by using a device such as a Sagnac interferometer which captures multiple images at different optical path differences, these being used to derive information equivalent to images taken over different spectral ranges.

Scatterometric Overlay Using Filters

Conventional imaging overlay tools have a high magnification and small field of view. Inspection for gross patterning defects is either done manually on a microscope or automatically on a separate macro inspection tool. A low magnification overlay tool unfortunately requires multiple steps or tools, some of which are manual.

In one embodiment, a low magnification microscope with a mechanism for selecting one or more wavelength ranges is provided. This tool also preferably uses one or more broadband sources with filters, with multiple sources covering different wavelength ranges, variable filters, etc. FIG. 6 is a diagrammatic representation of a system 600 for selecting one or more wavelength ranges in accordance with one embodiment of the present invention. As shown, the system 600 includes a broadband source 602 for generating a multiple wavelength incident light beam 604 towards sample 606. A multiple wavelength output beam 608 is scattered from the sample 606 in response to the incident beam 604. The system 600 also includes a filter 610 for selectively passing a portion of the output beam 611 based on wavelength to camera 612. In one implementation, the filter is configurable to pass particular colors, such as red, green, blue, or yellow. The camera is operable to generate an image based on the filtered output beam 611.

Measurements of overlay are taken by moving to a location on the sample where one or more of the targets in a target set are in the field of view of the microscope. An image is acquired and the intensity from some or all of the pixels in the image, which includes each individual target, are averaged or summed to give an intensity value for the target at a particular setting of the filter. In one embodiment, the filter is adjusted so as to give a maximum difference between targets. This may subsequently be normalized with respect to a reference surface, to the number of pixels summed, or corrected by a map of the illumination uniformity within the field of view. The sample or optics may then be moved until all of the necessary targets in a target set are measured. The overlay value is then determined using the intensity values thus determined, as described above, e.g., by: P1=(Ia−Ib) and P2=(Ic−Id) And: Overlay=f0*(P2+P1)/(P2−P1)

This process could be repeated over multiple wavelength ranges to improve accuracy, precision, and robustness, wherein the wavelengths which result in the best contrast are used for the scatterometry analysis.

Because the magnification is low and the field of view is large compared to a typical imaging overlay tool and because an image of the area of the sample is colleted, unlike a conventional reflectometer or ellipsometer, analysis of the image may be used to detect other types of processing problems by analyzing the image. For example, if the wrong reticle has been used for one or more processing steps, the image would be materially different. If the resist thickness were incorrect, the brightness or contrast of the image may be effected. If resist streaking were present, variation of brightness or contrast over the image may be detected. In CMP (chemical mechanical polishing) processes, processing errors such as over-polish, under-polish, etc. could be similarly detected.

In this embodiment, multiple scatterometry targets can be measured simultaneously, increasing the measurement speed. Additionally, processing errors or changes in processing conditions other than overlay can be detected without the need for a separate inspection tool.

Multi-Angle, Simultaneous Scatterometry

Techniques of obtaining scatterometric measurements may include the 2-theta approach, in which scattering intensity from a grating or other repeating structure is measured at a plurality of angles by making multiple, sequential measurements. Another technique of making scatterometric measurements is spectroscopic scatterometry. Use of the 2-theta approach is very slow, since multiple measurements are typically made. Use of spectroscopic scatterometry requires sophisticated expensive optics.

In a specific embodiment of the present invention, techniques and apparatus for simultaneous, multi-angle scatterometry are provided. Unlike the 2-theta approach, measurements are made with an apparatus which permits scattering intensity to be simultaneously determined for many angles. This technique is far faster than the 2-theta approach.

In order to implement this approach, an optical apparatus such as that shown in U.S. Pat. No. 5,166,752 by Spanier et al could be used. This patent is herein incorporated by reference in its entirety. In this patent, a multi-angle ellipsometer is shown in, for example, FIGS. 3 and 4 of the Spanier et al. patent. FIG. 7 is a diagrammatic representation of a simultaneous, multiple angle of incidence ellipsometer 700. As shown, the ellipsometer includes a source generator (e.g., components 702, 706, 708, 710, and 712) for directing polarized light onto the surface of sample 714, detection optics (e.g., components 718 through 724) for handling and detecting the output beam emitted from the sample, and an analyzer 726 for analyzing the polarization state of the light reflected from the sample. The source generator includes a light source 702, a polarizer 708, a compensator 710 with a variable aperture, and a focusing lens system 712 simultaneously directing polarized light from a single beam of light from the light source onto the sample's surface at different angles of incidence. The source generator may also include an optional optical narrow band filter.

The lens system 712 has an effective aperture to focal length ratio for focusing the light on the sample 714 with angles of incidence which vary over a range of angles of at least one or two degrees. In a particular embodiment, the range of angles of incidence is 30 degrees. Larger angles could be employed for directing rays at the sample 714.

The focusing lens system 712 focuses the polarized light which may be from a He—Ne laser for example, down to a single small spot or point on the sample 714. The different incident rays may have widely varying angles of incidence which are focused on a single, small spot on the sample 714. Thus, the light directed on the small spot on sample 714 contains rays at many angles of incidence above and below the angle of incidence of the central ray through the focusing lens. Each one of the incoming rays is reflected at an angle equal to its angle of incidence with the polarization state of each of the rays being altered by that reflection. A detector array 722 is employed to detect a plurality of rays reflected from the sample 714 individually over different, narrow ranges of angles of incidence to simply and quickly obtain data at a plurality of angles of incidence.

The output beam emitted from the sample 714 is directed through output lens 716, interchangeable aperture 718, polarization analyzer 720, and an optional alternate filter 722 onto detector array 722. The diameter d of the lenses 712 and 716 corresponds to their effective diameter. In the illustrated embodiment the lenses 712 and 716 each have a diameter d of 18 mm and a focal length 1 of 34 mm. Other effective lens diameters and focal lengths could be employed so long as a range of angles of incidence, preferably at least 30 degrees, is provided. The lens diameter and focal length are chosen with a view toward maximizing the number of angles of incidence of the light beams which strike the sample 714. In an alternative embodiment, light is transmitted through the sample rather than reflected from a surface of the sample.

The refocusing lens or lenses 712 directs the reflected (transmitted) light toward the detector array 722. However, a refocusing lens need not be employed as the reflected (transmitted) light could be made to directly impinge upon an array of detectors. It is preferable that the lenses 712 and 716 do not themselves alter the polarization state of the light.

The detector array 722 may be a linear, multiple element detector wherein each of the detector elements can detect a narrow range of angles of incidence of the rays that illuminate the sample. In the disclosed embodiment, the array 722 is a solid-state photosensitive detector array wherein the separate detector elements are all integrated on one circuit chip. Particularly, the detector elements comprise a linear array of photodiodes. While integrated on a single circuit chip, the individual photodiodes can function as separate detectors. The linear array of the disclosed embodiment comprises 128 detector elements arranged in a row to provide data for 128 different angles of incidence where the full array is illuminated by the reflected (transmitted) light. The number of individual detector elements could be more or less than that in the disclosed embodiment and the detector elements need not be integrated on a single chip but could be discrete detectors. By using a plurality of detector elements, it is possible to simultaneously detect the light reflected from the surface (or transmitted through the sample) for each of a plurality of different angles of incidence. It is also possible with the invention to employ a smaller number of detector elements which could be sequentially moved to mechanically scan the reflected (transmitted) rays for detection but this technique would require more time and could be less accurate, depending upon positioning accuracy.

The physical size of each of the detector elements is preferably less than the expanse of the reflected rays so that each element detects only a certain narrow range of angles of incidence on the illuminating side. The output of each of the detectors is used in a conventional manner as with real time computer techniques (e.g., via analyzer 724) to generate data in terms of Δ and Ψ for each of those narrow ranges of angles of incidence. The data is then interpreted in a conventional manner. It matters in general which direction the linear array runs; the linear array preferably runs in the plane of the optical system. In the disclosed embodiment, the long axis of the linear detector array 722 lies in the plane of incidence of the central ray and perpendicular to the central ray for detecting the maximum number of incidence angles.

Such an ellipsometer could be used to illuminate a scatterometry target simultaneously over a range of angles, and an intensity of the scattered light is measured over a range of angles simultaneously with an array detector or the like. By collecting data from the intensities measured at those angles, the parameters of the grating or other target can be determined. For example, the data can be compared against theoretical models of data derived from techniques such as those mentioned by U.S. Pat. No. 6,590,656, issued Jul. 8, 2003, entitled “SPECTROSCOPIC SCATTEROMETER SYSTEM” by Xu et al, which patent is herein incorporated by reference in its entirety. The data can be compared to theoretical models derived from techniques such as those mentioned by U.S. patent application, having application Ser. No. 09/833,084, filed 10 Apr. 2001, entitled “PERIODIC PATTERNS AND TECHNIQUE TO CONTROL MISALIGNMENT”, by Abdulhalim, et al., which application is herein incorporated by reference in its entirety.

The data can be pre-generated and stored in libraries, or generated in real time during analysis. It is also possible, for techniques like scatterometric overlay, to directly compare measured spectra associated with various targets. Such differential measurements can then be used to determine overlay misregistration.

It would also be possible to perform this technique with a beam profile reflectometer such as that described in U.S. Pat. No. 4,999,014, issued 12 Mar. 1991, entitled “METHOD AND APPARATUS FOR MEASURING THICKNESS OF THIN FILMS” by Gold et al., which patent is incorporated herein by reference in its entirety.

Simultaneous Ellipsometry and Reflectometry

A system for employing a combination of ellipsometers and reflectometers may be employed to improve the accuracy of scatterometric measurements of overlay. In one embodiment, two or more ellipsometers are utilized as scatterometers to measure overlay. One of more of these ellipsometers could be spectroscopic ellipsometers. In another embodiment, two or more reflectometers are utilized as scatterometers to measure overlay. One of more of these reflectometers could be polarized reflectometers. Alternatively, a combination of one or more ellipsometers and one or more reflectometers are utilized as scatterometers to measure overlay.

Measurements can be performed serially (with each tool performing measurements at different times), in parallel (with all tools performing measurements substantially-simultaneously, or in any other arrangement (e.g., at least two but less than all of the tools performing measurements substantially-simultaneously).

In any of the implementations described herein, various tools may perform measurements at different angles of incidence, including normally and obliquely. In a specific implementation, at least two tools perform scatterometric measurements at substantially the same angle of incidence but from different directions. For instance, a first tool would be used for scatterometric measurements in the x direction, and a second tool would be used for scatterometric measurements in the y direction. Such a system could eliminate certain common scattered signals, with a corresponding increase in accuracy of measurements, and provide a symmetric configuration.

An advantage of employing a combination of such tools in scatterometric determination of overlay is that the accuracy of the measurements could be increased. Another advantage in using more than one tool and performing measurements at more than one angle (or point) of incidence is to help separate effects affecting the medium of interest (e.g., film effects) from overlay. A further advantage is that combinations of ellipsometers and reflectometers already exist in current inspection tools. Another advantage of employing a combination of scatterometers configured to perform scatterometry measurements substantially in parallel on different targets or different target sections could be to reduce the total time required for measurement. Another advantage of a parallel measurement system could be to increase the signal acquisition time for each scatterometry overlay target and improve the measurement precision.

Scatterometric Overlay Determination Using FT processing

A system for scatterometric measurement of overlay using fourier transform (FT) processing may also be utilized. In one embodiment, an interferometer is employed to modulate substantially all wavelengths of a broadband source, and the scattered radiation is detected with a CCD camera. Substantially all wavelengths of the modulation band are recorded for each pixel, or for groups of pixels. As the interferometer steps through the modulation band, a spectroscopic image of the scattered signal is produced.

The resulting spectroscopic image may have a relatively large field of view. For example, the image may include several multiple targets. The spectroscopic image could be processed on a pixel-by-pixel basis to accurately determine overlay while eliminating extraneous effects (e.g., film effects). Alternatively, processing could be performed using groups of pixels to improve speed and decrease processing resources. For example, a group of pixels from each target may be analyzed in accordance with the above described scatterometry process. In one implementation, the images for each corresponding pair of targets is subtracted to obtain difference images D1 and D2. A characteristic, such as average intensity, of each difference signal is then obtained to result in P1 and P2, which are then used to determine the overlay error.

In a specific implementation, a Michelson interferometer is used to step through a wavelength modulation band. Alternatively, a Linnik interferometer, or any other interferometer, could be employed. For each position of the mirror, a CCD camera records the scattered signal intercepted in the field of view of the camera. The detected signals may then be digitized and stored on a pixel-by-pixel basis, or as groups of pixels. The magnitude of the steps is generally proportional with the accuracy of the overlay measurement. The speed of the camera (e.g., the number of fields per second that the camera can capture) is typically proportional with the speed of the measurement. Once the modulation band is spanned, the signal recorded for each pixel (or group of pixels) may be used as a basis for a discrete fourier transformation (or DFT). The DFT provides a spectral profile for each pixel (or group of pixels). This spectral profile for each target may then be used in the scatterometry overlay techniques described in the previous paragraph. Overlay determination can then be performed with increased accuracy.

Multiple Tunable Lasers

A system which has a combination of tunable lasers may be utilized to improve the accuracy of scatterometric measurements of overlay in combination with measurements performed by various configurations of ellipsometers and reflectometers. The tunable lasers provide radiation incident on the surface of interest. In one embodiment, scatterometric overlay measurements are performed using targets disposed in at least one layer of the design under consideration, and the tunable lasers provide radiation beams incident on the targets.

The measured signals may then be averaged together before or after processing. In one embodiment, measured radiation beams are obtained from targets A, B, C, and D. Two difference signals D1 an D2 from each pair of targets may then be obtained at multiple tunable laser settings. The signals measured from each target for each tunable laser setting may be averaged together prior to obtaining the difference signals D1 and D2. Alternatively, each set of differences signals for D1 and D2 may be averaged together to obtain a single average difference signal D1 and D2. Properties P1 and P2 of the difference signals D1 and D2 (e.g., integration) may then be obtained. In an alternative embodiment, multiple properties P1 and P2 are obtained for the different configurations of the tunable laser (without averaging the measured signals or the difference signals D1 and D2) and the results are averaged for each signal P1 and P2. The overlay error may then be obtained based on the signals P1 and P2 as described above.

Scatterometric Overlay Determination Using Spatial Filtering

One embodiment expands on the above described embodiment for Scatterometric Overlay Determination using FT Processing.

A system for scatterometric measurement of overlay using FT processing in connection with spatial filtering is provided. More particularly, the signal reflected by at least one scatterometry target is selectively filtered spatially to only process particular signal components.

In the above described embodiment for Scatterometric Overlay Determination using FT Processing, an interferometer is employed to modulate substantially all wavelengths of a broadband source, and the scattered radiation is detected with a detector, such as a CCD camera. Substantially all wavelengths may then be recorded for each pixel, or for groups of pixels. As the interferometer steps through the modulation band, a spectroscopic image of the scattered signal is produced. In the present example, where the scattered signal corresponding to a complete image (or a portion of an image) is collected, only a portion of the signal corresponding to a single line of pixels is retained. Alternatively, a portion of the signal corresponding to a plurality of pixel lines, but less than the whole image, is collected. Such a selective collection of the scattered signal can be achieved by spatially filtering the signal to only retain horizontal, vertical or oblique stripes of the signal corresponding to rows of pixels in the detector or CCD camera. Alternatively, a larger, more complete portion of the scattered signal could be collected at the CCD camera, but the information corresponding to undesirable rows of pixels (e.g., an edge of a target or a border between two targets) may be discarded subsequent to the collection.

The spectroscopic image corresponding to the retained signal may then be processed on a pixel-by-pixel basis to accurately determine overlay while eliminating extraneous effects (e.g., film effects). Alternatively, processing could be performed using groups of pixels to improve speed and decrease processing resources. This embodiment provides higher SNR over conventional processing methods.

In one implementation of the invention, the above described techniques to determine overlay in reference to the Scatterometric Overlay Determination using FT Processing embodiment may be used.

Compared to the embodiment for Scatterometric Overlay Determination using FT Processing, aspects of the embodiment for Scatterometric Overlay Determination Using Spatial Filtering may improve the processing speed and the throughput, while decreasing processing resources.

Examples of Spectroscopic Ellipsometers and Spectroscopic Reflectometers

FIG. 8 is a schematic view of a spectroscopic scatterometer system 800, in accordance with one embodiment of the present invention. The system 800 combines the features of a spectroscopic ellipsometer 802 and spectroscopic reflectometer 804, each of which may be used for measuring overlay of a grating structure 806 disposed on a substrate or wafer 808. The grating structure 806, which is shown in a somewhat simplified format in the Figure, may be widely varied. The grating structure 806 may, for example, correspond to any of those grating structures described herein. Both the spectroscopic ellipsometer 802 and spectroscopic reflectometer 804 may utilize a stage 810, which is used for moving the substrate 808 in the horizontal xy directions as well as the vertical z direction. The stage may also rotate or tilt the substrate. In operation, the stage 810 moves the substrate 808 so that the grating structure 806 can be measured by the spectroscopic ellipsometer 802 and/or the spectroscopic reflectometer 804.

The spectroscopic ellipsometer 802 and spectroscopic reflectometer 804 also utilize one or more broadband radiation sources 812. By way of example, the light source 812 may supply electromagnetic radiation having wavelengths in the range of at least 230 to 800 nm. Examples of broadband light sources include deuterium discharge lamps, xenon arc lamps, tungsten filament lamps, quartz halogen lamps. Alternatively, one or more laser radiation sources may be used instead of or in combination with the broadband light source.

In the spectroscopic reflectometer 804, a lens 814 collects and directs radiation from source 812 to a beam splitter 816, which reflects part of the incoming beam towards the focus lens 818, which focuses the radiation onto the substrate 808 in the vicinity of the grating structure 806. The light reflected by the substrate 808 is collected by the lens 818, passes through the beam splitter 816 to a spectrometer 820.

The spectral components are detected and signals representing such components are supplied to the computer 822, which computes the overlay in a manner described above.

In the spectroscopic ellipsometer 802, the light source 812 supplies light through a fiber optic cable 824, which randomizes the polarization and creates a uniform light source for illuminating the substrate 808. Upon emerging from the fiber 824, the radiation passes through an optical illuminator 826 that may include a slit aperture and a focus lens (not shown). The light emerging from the illuminator 826 is polarized by a polarizer 828 to produce a polarized sampling beam 830 illuminating the substrate 808. The radiation emerging from the sampling beam 830 reflects off of the substrate 808 and passes through an analyzer 832 to a spectrometer 834. The spectral components of the reflected radiation are detected and signals representing such components are supplied to the computer 822, which computes the overlay in a manner described above.

In the spectroscopic ellipsometer 802, either the polarizer 828 or the analyzer 832 or both may include a waveplate, also known as compensator or retarder (not shown). The waveplate changes the relative phase between two polarizations so as to change linearly polarized light to elliptically polarized light or vice versa.

In order to collect more information about the interaction of the incident polarized light 830 with the sample, it is desirable to modulate the polarization state of the light or modulate the polarization sensitivity of the analyzer or both. Typically this is done by rotating an optical element within the polarizer and/or analyzer. A polarizing element within the polarizer or analyzer may be rotated, or, if at least one of those assemblies contains a waveplate, the waveplate may be rotated. The rotation may be controlled by the computer 822 in a manner known to those skilled in the art. Although the use of a rotating element may work well, it may limit the system 802. As should be appreciated, the use of rotating elements may be slow, and because there are moving parts they tend to be less reliable.

In accordance with one embodiment, therefore, the polarizer 828 is configured to include a polarization modulator 836, such as photoelastic modulator (PEM), in order to produce a fast and reliable spectroscopic ellipsometer. The polarization modulator replaces the rotating waveplate. The polarization modulator 836 is an optical element that performs the same function as a rotating waveplate, but without the costly speed and reliability problems. The polarization modulator 836 allows electrical modulation of the phase of the light without mechanically rotating any optical components. Modulation frequencies as high as 100 kHz are readily attainable.

In an alternative embodiment, the analyzer 832 is configured to include a polarization modulator such as a PEM (Photoelastic Modulator) that can be modulated electrically. In yet another embodiment, both the polarizer and analyzer contain polarization modulators, such as PEMs, that are modulated at different frequencies.

Because the polarization modulator 836 can modulate at such a high frequency, the polarization modulator 836 may be used to perform various techniques, which would otherwise be too slow. For example, the difference between the polarized reflectivity of two structures may be obtained. To do this, a PEM may be combined with an acoustic optical modulator (AOM), where the AOM rapidly moves between the two structures while modulating the polarization state at a different (but related, such as multiple or submultiple) frequency. Signals at the sum and the difference of the PEM and AOM modulation frequencies contain useful information and can be detected with high signal-to-noise by synchronous detection. Alternatively the AOM on the incident beam could be used in combination with a PEM in the analyzer.

Although not shown, the rotating waveplate may also be replaced by a polarization modulator in other types of scatterometric systems as for example a polarization sensitive reflectometer.

Scatterometric Overlay Database

One aspect of the present invention provides a database of scatterometric overlay information that may be utilized for scatterometic overlay determination.

In one implementation, one or more database are provided which include one or more libraries of overlay information. The database information is then used in overlay measurements.

In one implementation, the libraries are compiled using predetermined test patterns with artificially-induced overlay. Alternatively, the libraries are produced using layer misregistrations programmed into the stepper. In another embodiment, the overlay that is induced or programmed has a progressive characteristic, varying within a particular range.

The information stored into the database may include overlay data regarding the actual overlay printed on the wafer, as induced via the test pattern or by the stepper. Alternatively, this information is obtained from overlay actually measured on samples. The database may further store scatterometry measurement records associated with the overlay data. Such scatterometry measurement records may be obtained by performing actual scatterometric measurements of the overlay data. The database may also include information regarding materials, process conditions, optical parameters, and other relevant data. The database information may be further enhanced by interpolation and other preprocessing.

The scatterometry database information may be utilized to improve the accuracy and speed of overlay measurements by retrieving scatterometry data associated with particular scatterometric measurements and process conditions recorded during actual measurements. Dynamic selection of a measurement algorithm or methods may also be provided based on database lookups. A further implementation utilizes the database to calibrate scatterometric overlay measurement tools before or during production line measurements.

Alternative Systems for Performing Scatterometry

According to an embodiment of the invention, acquisition of the spectra A through D (and of additional spectra if present) is performed using an optical apparatus that may comprise any of the following or any combination of the following apparatus: an imaging reflectometer, an imaging spectroscopic reflectometer, a polarized spectroscopic imaging reflectometer, a scanning reflectometer system, a system with two or more reflectometers capable of parallel data acquisition, a system with two or more spectroscopic reflectometers capable of parallel data acquisition, a system with two or more polarized spectroscopic reflectometers capable of parallel data acquisition, a system with two or more polarized spectroscopic reflectometers capable of serial data acquisition without moving the wafer stage or moving any optical elements or the reflectometer stage, imaging spectrometers, imaging system with wavelength filter, imaging system with long-pass wavelength filter, imaging system with short-pass wavelength filter, imaging system without wavelength filter, interferometric imaging system (e.g. Linnik microscope, e.g. Linnik microscope as implemented in the KLA-Tencor overlay measurements tools models 5100, 5200, 5300, Archer10, etc. available from KLA-Tencor of San Jose, Calif.), imaging ellipsometer, imaging spectroscopic ellipsometer, a scanning ellipsometer system, a system with two or more ellipsometers capable of parallel data acquisition, a system with two or more ellipsometers capable of serial data acquisition without moving the wafer stage or moving any optical elements or the ellipsometer stage, a Michelson interfereometer, a Mach-Zehnder interferometer, or a Sagnac interferometer.

Several embodiments of an interferometer based imaging spectrometer, as well as other types of imaging spectrometers such as filter based or the “push broom” approach, are described in U.S. Patent, having U.S. Pat. No. 5,835,214, issued 10 Nov. 1998, entitled “METHOD AND APPARATUS FOR SPECTRAL ANALYSIS OF IMAGES”, by Cabib et al. System and Method embodiments for film thickness mapping with spectral imaging are described in U.S. Patent, having U.S. Pat. No. 5,856,871, issued 5 Jan. 1999, entitled “FILM THICKNESS MAPPING USING INTERFEROMETRIC SPECTRAL IMAGING”, by Cabib et al. An alternative architecture for spectral imaging based on LED illumination is described in U.S. Patent, having U.S. Pat. No. 6,142,629, issued 7 Nov. 2000, entitled “SPECTRAL IMAGING USING ILLUMINATION OF PRESELECTED SPECTRAL CONTENT”, by Adel et al. These patents are incorporated herein by reference in their entirety for all purposes.

The imaging spectrometer or reflectometer used for acquisition of the spectra A through D from the four targets (and of additional spectra if present) according to an embodiment of the invention may be of the Fourier transform imaging spectrometer type as is well understood by those skilled in the art. The imaging system of the Fourier transform imaging spectrometer should be capable of separating (resolving) the reflected or scattered light signals from the different targets (or sections of a compound scatterometry overlay target). Alternatively the imaging spectrometer or reflectometer used for acquisition of scatterometry overlay signals may use a two-dimension detector where one axis contains the spatial information from the different scatterometry overlay targets (or sections of a compound scatterometry overlay target) and the other detector axis contains spectrally resolved information from light spectroscopically separated with a prism system or diffraction grating system, for example or a system that is a combination of a prism and a grating. The illumination radiation may be wavelength selected prior to incidence on the target.

The spectra A through D obtained from the four targets (and additional spectra if present) detected in the imaging spectrometers, imaging reflectometers, or any of the other optical systems identified above in connection with various embodiments of the present invention may be unpolarized or selectively polarized. One or more of the unpolarized light or one or more of the polarization components of the reflected or scattered light from the targets may be detected with the imaging spectrometer or the imaging reflectometer.

In various implementations, separate detection systems may be used to separately or simultaneously record one or more of the following light signals: unpolarized reflected light, polarized light with the electric field substantially parallel to one major symmetry axis of one layer of the scatterometry overlay targets, polarized light with the electric field substantially perpendicular to one major symmetry axis of one layer of the scatterometry overlay targets, polarized light with the electric field at an angle to one major symmetry axis of one layer of the scatterometry overlay targets, right-hand circularly polarized radiation, left-hand circularly polarized radiation, and/or a combination of two or more of the previously listed polarization states. A separate detector system may be used to simultaneously record the signal from part of the light source for the purposes of light noise monitoring, and/or light level control, and/or light noise subtraction or normalization.

Various possible implementations of various embodiments of the present invention are illustrated in co-pending U.S. Provisional Application No. 60/449,496, filed 22 Feb. 2003, entitled METHOD AND SYSTEM FOR DETERMINING OVERLAY ERROR BASED ON SCATTEROMETRY SIGNALS ACQUIRED FROM MULTIPLE OVERLAY MEASUREMENT PATTERNS, by Walter D. Mieher et al. This provisional application is herein incorporated by reference in its entirety.

In one embodiment, each of the four targets is illuminated by radiation produced by an optical system. The optical system may take the form of, among others, an optical source, a lensing system, a focusing system, a beam shaping system, and/or a directing system. In one embodiment, the radiation illuminating at least one of the targets is shaped as a radiation beam, with a relatively narrow beam cross section. In a particular implementation, the beam is a laser beam. The radiation illuminating the targets interacts with structures comprised within the targets and produces diffracted radiation components corresponding to each target and denoted as S_(A), S_(B), S_(C), and S_(D). In one embodiment, the illuminating beam is a broadband polarized beam comprising a broad spectral range as is commonly used in spectroscopic ellipsometry.

Scatterometry Overlay Technique Alternatives:

Several related techniques are described in the above related co-pending U.S. Provisional Applications. These related techniques may be easily integrated with the techniques described herein.

In one embodiment of the invention, the targets (or compound scatterometry target sections) with different programmed offsets +/−F and +/−f0 as described above, or +/−F or other similar target combinations, are grouped together to enable simultaneous signal acquisition. In one implementation targets are arranged in a line to enable data acquisition while scanning the wafer or some or all of the optics in one direction along the array of scatterometry overlay targets. Arranging the targets in a linear array may also enable use of an imaging spectrometer or reflectometer, where one detector axis separates the signals from the different targets (or target sections) and the other detector axis detects the spectral information. In this case the imaging system images a linear or cylindrical image of the linear target array into the prism or grating system. The imaging spectrometer or imaging reflectometer may contain an array of two or more lenses (known to those skilled in the art as a lenslet array) to separate and direct the reflected or scattered light from different targets or target sections.

In one embodiment, the primary offset F is optimized to provide larger or maximum sensitivity to overlay errors. For instance, an offset F equal to ¼ of the pitch of the target provides high overlay sensitivity since it is half-way in-between the two symmetry points where overlay error sensitivity is minimum. The secondary offset f0 may be chosen such that the f0 is outside the region of interest for overlay measurements, such as equal to or beyond the specification limits, but it should not cause the uncertainty of the overlay measurement to allow the error that an out-of-spec measurement can be interpreted as within specifications. Nevertheless, this is not a limitation on the range of f0. A large f0 may decrease the accuracy of the overlay measurements for overlay errors E between −f0 and +f0. For overlay errors E larger than |f0|, the accuracy of the overlay measurement may be reduced due to extrapolation beyond the region −f0 to +f0 and the accuracy of the linear approximation may also be reduced.

Overlay measurements are most commonly done at or near the four corners of the stepper field (sometimes with an additional measurement near the center of the field) in 5 to 25 fields per wafer in semiconductor manufacturing processes. For a system of four targets used to determine overlay in the x direction and four targets used to determine overlay in the y direction, according to an embodiment of the present invention, a total of 8*4*5=160 measurements of scatterometry overlay targets may be used to determine the two dimensional overlay for a common overlay measurement sampling plan. More measurements may be conducted for more detailed sampling plans.

According to another embodiment of the invention, a total of six targets (three for x and three for y, for example) can be used to determine two dimensional overlay for the sample. This may facilitate further simplification of the overlay metrology process, reduction in processing resources, and decrease of the time used in the metrology process. In yet other implementations, additional targets or additional pairs of targets may be produced on the sample and used in a substantially similar manner with that described herein for determination of overlay based on scatterometry, but adjusted for the increased number of targets and corresponding number of diffracted radiation components. The mathematical methods for determination of the overlay error E can be similarly adjusted to exploit the availability of increased information provided by such additional targets or additional pairs, including by possibly accounting for higher order approximation terms in the formula for the overlay error E.

Scatterometric Overlay Determination With Limited Refocusing

To improve the accuracy of scatterometry overlay determination, more than one measurement is preferably carried out. One implementation utilizes a plurality of scatterometry overlay targets, and for each target, the system makes one scatterometric measurement of overlay. Another implementation utilizes a single scatterometric target, or a single scatterometric target area that comprises multiple target sub-regions, and more than one scatterometric overlay measurement is performed for that target or target area. In yet another embodiment, a plurality of targets or target regions are used, and more than one measurement is performed for some or all of the targets or target regions.

Conventionally, the optical system is refocused for each individual measurement. This, however, can consume a lot of time thus decreasing the processing speed of the system. For example, each focus sequence may take between 0.1 and 1 seconds, and each wafer may include between 30 to 70 sites with each site consisting of 8 targets. Using these numbers, refocusing may take up to as much as 560 seconds for each wafer. Considering there are typically 100 s and 1000 s of wafers to be inspected this number may be further increased to a completely unacceptable level.

In accordance with one embodiment of the present invention, therefore, multiple scatterometry overlay measurements are performed with limited optical refocusing in order to increase the processing speed and throughput of the system. By limited optical refocusing, it is generally meant that at least some new measurements are performed without refocusing the optical system, i.e., multiple measurements are made with the same focus setting. For instance, the optical system may be initialized with a focus setting that is optimized for a plurality of scatterometric measurements that will be performed, and no further refocusing takes place during these individual scatterometric measurements. The optimized focus setting may be found once for the entire wafer, or it may be found periodically. When periodic, the focus setting may be established at preset increments of time during inspection (e.g., every 30 seconds), for a particular location on the wafer (e.g., every 2×2 cm2 of wafer), for a particular characteristic of the target (e.g., similar line widths and spacing) and the like.

In one embodiment, the wafer includes a plurality of focus zones. Each of the focus zones is initialized with a focus setting that is optimized for all of scatterometric measurements that will be performed within the focus zone. Refocusing does not occur between individual scatterometric measurements inside the focus zone. As such, each target within the focus zone is measured with the same optimized focus setting. Any number of focus zones may be used.

The configuration of the focus zones may be widely varied. In one implementation, the focus zones correspond to a portion of the wafer. By way of example, the wafer may be broken up into plurality of radial focus zones emanating at the center of the wafer and working outwards, or into a plurality of angular focus zones, which separate the wafer into multiple quadrants. In another implementation, the focus zones correspond to a particular set of targets as for example, the targets at the corners of each semiconductor device. In another implementation, the focus zone corresponds to a particular target area that includes a plurality of targets (see for example 9A). In another implementation, the focus zones correspond to a particular target sub-region within the target areas (as for example the x or y directed group of targets shown in FIG. 9B). In yet another implementation, the focus zone corresponds to a particular sub region within the target itself.

A method of determining overlay will now be described. The method generally includes optimizing the focus setting of a first zone. The method also includes performing a first set of measurements on a plurality of targets within the first zone. Each of the targets within the first zone is measured using the optimized focus setting of the first zone. That is, a first target is measured, and thereafter a second target is measured without refocusing the optical system. Any number of targets can be measured in this manner. The method further includes optimizing the focus setting of a second zone. The method additionally includes performing a second set of measurements on a plurality of targets within the second zone. Each of the targets within the second zone is measured using the optimized focus setting of the second zone. That is, a first target is measured, and thereafter a second target is measured without refocusing the optical system. Any number of targets can be measured in this manner.

In one example of this method, the first and second zones may represent different target areas that include a plurality of targets (See FIG. 9A). In this example, each of the targets are located in close proximity to one another and therefore it can be assumed that variations in focus from one target to the next are minimal. The method generally includes optimizing the focus setting in the target area, and thereafter measuring each of the targets in the target area with the optimized focus setting. For example, the first target is measured, thereafter the adjacent target is measured, and so on without ever refocusing the optical system. When a first target area is measured, the system may repeat these steps on a second target area, as for example, a target area located at a different corner of the device.

In another example of this method, the first and second zones may represent sub regions with a target area that includes a plurality of targets. The sub regions may for example represent different target orientations (See FIG. 9B). The method generally includes optimizing the focus setting in the first sub regions (e.g., targets along the x axis), and thereafter measuring each of the targets in the sub region with the optimized focus setting. For example, the first target is measured, thereafter the adjacent target is measured, and so on without ever refocusing the optical system. When the first sub region is measured, the method continues by optimizing the focus setting in the second sub regions (e.g., targets along the y axis), and thereafter measuring each of the targets in the sub region with the optimized focus setting. For example, the first target is measured, thereafter the adjacent target is measured, and so on without ever refocusing the optical system. In another example the system is refocused prior to the measurement on the first scatterometry overlay target in an xy scatterometry overlay target group. After the scatterometry signal is measured for the first target in the xy overlay target group, the rest of the targets may be measured without refocusing. For example, an xy overlay target group comprises four scatterometry overlay targets for an overlay error determination in the x direction and four scatterometry overlay targets for an overlay error determination in the y direction.

Scatterometric Overlay Determination Using a Line Image

A system for scatterometric measurement of overlay using a line image may also be implemented.

In one embodiment, a scatterometric target is illuminated along a single incidence line. The scattered radiation is intercepted by a dispersive system, such as a prism or diffraction grating. The scattered radiation is thereby dispersed as a function of wavelength. The dispersed radiation is then captured by a detector, such as a CCD camera. If the camera is properly aligned, the radiation entering the field of view has a two dimensional profile with points along the incidence line distributed along the X-axis of the field of view, and various wavelengths dispersed along the Y-axis. An example incidence line and field of view are illustrated in FIG. 10.

The image captured by the camera can then be processed at pixel level to determine overlay, possibly using the FT approach disclosed herein. Once overlay is measured along a particular incidence line, the wafer could be rotated by 90 degrees (or by any arbitrary angle) to measure overlay in a different direction. An advantage of the present invention is that overlay may be measured in more than one direction using a single optical system.

An alternative to illuminating a single incident line is illuminating a larger area but only capturing scattered radiation along a detection line. The description provided above also applies to this embodiment, with appropriate modifications.

Algorithms

Various algorithms and methods may be employed for more efficiently and accurately measuring overlay based on scatterometry.

Old methods of performing such calculations use a model- based method, or a differential method for calculating overlay. These conventional methods lack the accuracy that may be achieved by combining multiple algorithms for refining and cross checking results. Also, these methods do not make good use of pre-existing information (like CD or profile data).

In one general algorithm implementation, overlay is determined using data from a plurality of separate calculations of different product parameters.

In a first embodiment, a first calculation of overlay is performed according to a first technique (such as the differential method). A second calculation of overlay is then performed according to a second technique (such as a model-based regression). The results are then combined from the two calculations. The results may be combined in various ways. For example, one calculation may be used to cross check another. Or one calculation may be used to provide initial values to speed up the other calculation. Other combinations may also be used.

In a second embodiment, the speed and/or accuracy of an overlay measurement are enhanced by making use of other measured data. For example, film thickness data from the layers making up the target may be fed into the algorithm. Such film thickness data could be measured using an appropriate tool, such as an ellipsometer or reflectometer. Alternatively (or additionally), CD data could be provided from an SCD measurement (scatterometry critical dimension or scatterometry profile measurement) and used to speed up or improve the accuracy of the scatterometry calculations. Other data from a scatterometry profile measurement could be similarly used, such as height or three dimensional profile information. Other sources of CD data, like a CD SEM, could be used.

Combined Scatterometry and Imaging Targets

In an alternative implementation, the targets are designed for an imaging based overlay metrology application, as well as for the above described scatterometry analysis. In other words, the scatterometry and imaging target structures are tightly integrated so that scatterometry may be performed in conjunction with an image based overlay measurement. Preferably, the scatterometry target pairs are symmetrically positioned about the center of the field of view. If symmetry is preserved in the illumination and collection channels of the imaging system, tool induced shift and will be minimized. By example, Xa and Xa′ are twin (have similar magnitude but opposite sign offset) targets in the x direction. (Here Xa and Xa′ may correspond to the targets Xa and Xd in FIG. 1). Likewise, Xb and Xb′ are opposites. (Here Xb and Xb′ may correspond to the targets Xb and Xc in FIG. 1). In the y direction, targets Ya and Ya′ are opposites, while Yb and Yb′ are opposites.

FIG. 11 a is a top view representation of a first combination imaging and scatterometry target embodiment. In this example, the target arrangement includes a set of four x direction targets for determining overlay using scatterometry and a set of four y direction targets for determining overlay using scatterometry. The targets are laid out so that adjacent targets (with respect to the overlay measurement direction) have an opposite offset. In the illustrated example, target Xa has an opposite offset than target Xa′, and target Xb has an opposite offset than target Xb′. Likewise, targets Ya and Ya′ have opposite offsets, and targets Yb and Yb′ have opposite offsets. In this example, the targets also include structures which can be used for imaged based overlay determination.

In the illustrated example, the target arrangement includes a black border structure 1104 on a first layer and a gray cross-shaped structure 1102 on a second layer. Using image analysis methods, the center of the black structure 1104 may then be compared with the center of the gray structure 1102 to determine an overlay error (if any).

Although this set of targets have an overall rectangular shape which extends longer in the x direction than the y direction, of course, the targets could have other shapes (e.g., square or any symmetrical polygon) and/or extend longer in a direction other than x.

In other combinational target arrangements, the imaging structures are laid out in the center of a symmetrically arranged set of scatterometry targets. FIG. 11 b is a top view representation of a second combination imaging and scatterometry target embodiment. As shown, scatterometry targets are symmetrically arranged around a central image type target 152. In this example, the image type target 1152 is formed from quadrants of line segments, where each quadrant is either in the x or y direction. Suitable image type targets and techniques for determining overlay with same are described in the following U.S. patents and applications: (1) U.S. Pat. No. 6,462,818, issued 8 Oct. 2002, entitled “OVERLAY ALIGNMENT MARK DESIGN”, by Bareket, (2) U.S. Pat. No. 6,023,338, issued 8 Feb. 2000, entitled “OVERLAY ALIGNMENT MEASUREMENT OF WAFER”, by Bareket, (3) application Ser. No. 09/894,987, filed 27 Jun. 2001, entitled “OVERLAY MARKS, METHODS OF OVERLAY MARK DESIGN AND METHODS OF OVERLAY MEASUREMENTS”, by Ghinovker et al., and (4) U.S. Pat. No. 6,486,954, issued 26 Nov. 2002, entitled “OVERLAY ALIGNMENT MEASUREMENT MARK” by Levy et al. These patents and applications are all incorporated herein by reference in their entirety.

FIG. 11 c is a top view representation of a third combination imaging and scatterometry target embodiment. This target arrangement has scatterometry target symmetrically arranged around a box-in-box type target 1154. A box-in-box target generally includes a first inner box formed from a first layer surrounded by a second outer box structure formed in a second layer. The centers of the inner box structures may be compared to the center of the outer box structures to determine overlay error (if present).

The above targets may be imaged in any suitable manner (e.g., as described in the above referenced patents and applications by Bareket, Ghinovker et al., and Levy et al.) to determine overlay. The target arrangements may also simultaneously or sequentially measured with any desirable optical tool as described herein to determine overlay using scatterometry techniques. In an alternative embodiment, the scatterometry targets may be simultaneously imaged along with the imaging type target structures. The resulting image may be subdivided into the separate scatterometry targets and then the scatterometry techniques applied to the image signals for each target (e.g., intensity). The image may be obtained at the same time as, or before or after the scatterometry overlay measurements. The imaging system may be a high-resolution microscope such as the system in the KLA-Tencor 5300 or Archer overlay measurement systems available from KLA-Tencor of San Jose, Calif. Alternatively, the imaging system may be a lower resolution imaging system used for other purposes that may include wafer alignment or pattern recognition.

Mask Alignment During Imprint Lithography

Because the mask and sample are typically in close proximity (separated by the fluid to be polymerized) during nano-imprint lithography, the patterned surface of the mask, the fluid, and the patterned sample to be aligned to can be considered to be functionally equivalent to a scatterometry overlay target. All of the methods, techniques and targets defined for scatterometry overly would then be applicable to alignment procedures.

In one embodiment, the measurement instrument would project radiation (preferably light) through the mask and onto an area of the mask and wafer which contains one or more scatterometry overlay targets. The change in properties of the reflected light due to scattering or diffraction may then be used to determine the offset between the pattern on the mask and the pattern on the wafer. The wafer is then moved relative to the mask (or vice versa) to achieve the desired offset. A more accurate alignment may the be achieved, rather than with conventional alignment techniques such as direct imaging or moire techniques. The instrument could be a reflectometer, polarized reflectometer, spectrometer, imaging reflectometer, imaging interferometer, or other such instrument as described herein or in the above referenced provisional applications.

Disposition of Scatterometry Overlay Targets

The accuracy of scatterometry overlay systems can be improved by taking measurements at multiple targets located across the surface of interest. In one implementation, the scatterometry overlay system may utilize a plurality of scatterometry targets at various locations across the surface of interest and for each target the system may make one scatterometric measurement of overlay. In another implementation, the scatterometry overlay system may utilize a plurality of scatterometry target areas at various locations across the surface of interest. The scatterometry target areas comprise multiple targets, each of which can be measured by the scatterometry overlay system. By way of example, the scatterometry targets or scatterometric target areas may be located at the corners of one or more devices being formed on a wafer. In addition, the scatterometry targets may generally include a grating structure, which is measurable by the scatterometry overlay system.

The number of targets generally depends on the available space on the surface of interest. In most cases, the targets are placed in the scribe line between devices on a wafer. The scribe line is the place on the wafer where the wafer is separated into dies via sawing or dicing and thus the circuit itself is not patterned there. In cases such as this, the number of targets may be limited, at least in part, by the narrowness of the scribeline. As should be appreciated, the scribe lines tend to be narrow so as to maximize the amount of devices on the wafer.

In accordance with one embodiment of the present invention, the targets are strategically placed on the surface of interest in order to overcome any space constraints while increasing the number of targets. In one implementation, at least two targets are placed substantially collinearly in a first direction. For example, they may be placed collinearly in the x-direction or the y-direction. This arrangement may be useful when confronted with narrow spaces as for example scribe lines. In another implementation, multiple targets are disposed collinearly in multiple directions. For example, multiple targets may be disposed collinearly in both the x direction and the y-direction. This arrangement may be useful at the corner of a device as for example at the intersection of two scribe lines.

Although the examples given are directed at a Cartesian coordinate system as defined on the surface of interest, it should be noted that the coordinate system may be oriented arbitrarily on the surface or interest with the x and y axis being rotated or possibly interchanged. Alternatively or in combination with the Cartesian coordinate system, any other coordinate system may be used such as for example, a polar coordinate system.

FIG. 9A is a top view diagram of a scatterometric target area 900 having one or more targets 902, in accordance with one embodiment of the present invention. The scatterometric targets 902 are generally provided to determine the relative shift between two or more successive layers of a substrate or between two or more separately generated patterns on a single layer of a substrate. By way of example, the scatterometric targets may be used to determine how accurately a first layer aligns with respect to a second layer disposed above or below it or how accurately a first pattern aligns relative to a preceding or succeeding second pattern disposed on the same layer.

As shown in FIG. 9A, the scatterometric target area 900 includes at least two substantially collinear targets 902. By collinear, it is generally meant that the centers of symmetry for each of the targets 902 lie on the same axis 904. By way of example, the axis 904 may be aligned with a conventional coordinate system (Cartesian, polar, etc.) or some variation thereof. By placing the targets 902 collinearly, the scatterometric target area 900 does not take up as much width W and therefore may be placed in constrained places as for example in the scribeline of the wafer.

The targets 902 are generally juxtaposed relative to one another along the axis 904. In most cases, the juxtaposed targets 902 are spatially separated from one another so that they do not overlap portions of an adjacent target 902. Each of the targets 902 is therefore distinct, i.e., represents a different area on the substrate. This is typically done to ensure that each of the targets 902 is properly measured. The space 906 between targets 902 produces distortions in the optical signal and therefore it is excluded from the overlay calculation. The size of the space 906 is typically balanced with the size of the targets 902 so as to provide as much information as possible for the measurement of overlay. That is, it is generally desired to have larger targets 902 and smaller spaces 906 there between. The space 906 between targets 902 may be referred to as an exclusion zone.

The targets 902 may be widely varied, and may generally correspond to any of those overlay targets that can be measured via scatterometry. By way of example, the targets 902 may generally include one or more grating structures 908 having parallel segmented lines 910. Although not a requirement, the segmented lines 910 for the collinear targets 902 are generally positioned in the same direction, which may be parallel or transverse to the axis 904. In most cases, some of the segmented lines 910 are perpendicular to the axis 904 and some are parallel to the axis 904 to enable overlay measurements in x and y. Furthermore, the targets 902 may have an identical configuration or they may have a different configuration. Configuration may for example include the overall shape and size of the target 902 or perhaps the line width and spacing of the segmented lines 910 associated with the grating structure 908 contained within the target 902. Preferably the targets used for the overlay measurement in a particular direction, for example x direction, are designed to have the same configuration except for the programmed or designed overlay offsets.

The number of targets may also be widely varied. As should be appreciated, increasing the number of targets, increases the number of data collection points and therefore the accuracy of the measurement. The number of targets 902 generally depends on the overall size of the targets 902 and the space constraints in the direction of the axis 904. In the illustrated embodiment, eight side by side targets 902 are positioned within the scatterometric target area 900. A scatterometric target area may be equivalent to an xy scatterometry overlay target group as discussed above.

Using the above mentioned targets 902, scatterometric overlay measurements may be made sequentially, one target at a time, to measure overlay while eliminating effects due to variations in other sample parameters, such as film thickness. This can be accomplished via continuously scanning of the scatterometric target area (including for example the targets and the spaces there between) or by stepping to each of the targets. Alternatively, measurements may take place substantially simultaneously using two or more scatterometry signal beams for two, more than two, or all targets to increase throughput. The multiple scatterometry signal beams may come from more than one substantially independent scatterometry optical systems, or they may share much of the optical system, for example they may share the same light source, the same beam directing optics, or the same detector system.

Although the method described above includes placing the centers of symmetry for each of the targets substantially collinear, it should be noted that the centers of symmetry may be offset from the axis so long as a measurable portion of the targets still falls on the same axis.

Furthermore, although the method described above includes placing targets of similar orientation along the same axis, it should be noted that some of the targets may be positioned with a different orientation. For example, a first group of the targets 902 may have segmented lines positioned in the x dimension while a second group of the targets 902 may have segmented lines position in the y dimension.

Moreover, although the targets 902 are only shown positioned along a single axis 904, it should be noted that the targets may be positioned on multiple axis. For example, as shown in FIG. 9B, a first group of targets 902A may be disposed collinearly along a first axis 904A and a second group of targets 902B may be disposed collinearly along a second axis 904B. This implementation permits independent measurement of overlay in at least two directions. The first and second axis are typically transverse to one another and more particularly perpendicular one another. In the illustrated embodiment, the first axis 904A corresponds to the X-dimension while the second axis 904B corresponds the Y-dimension. Furthermore, each group consists of four targets 902. This implementation permits independent measurement of overlay in the X and Y directions.

Further still, although the targets have been described as having features (e.g., segmented lines) in substantially one direction, it should be noted that the targets may include features in more than one direction. In one implementation, for example, one or more of the collinearly positioned targets include features that permit scatterometric overlay measurement in first and second directions. By way of example, the features such as the segmented lines may be positioned in both the X and Y dimensions. In this case, the need for disposing targets along more than one axis as shown in FIG. 9B is reduced or eliminated. That is, if each target has features that permit two-dimensional scatterometric measurements, overlay may be determined along both the X- and Y-axes using a single set of targets disposed substantially-collinearly along a single axis. Alternatively, one or more targets may include one or more sub-targets. If the sub-targets have features that permit two-dimensional scatterometric measurements, the number of targets desirable for a particular degree of measurement accuracy may be reduced, and the targets may be disposed along a single line.

Additionally, targets disposed along one or more axis may be used for measurement of more then one parameter. For example, a first set of targets may be used for scatterometric measurement of wavelength along the X-axis and a second set of targets may be used for scatterometric measurement of spatial resolution along the Y-axis. In an alternative implementation, scatterometric measurement of spatial resolution may be performed along the X-axis while spectral measurements are performed along the Y-axis.

Combined CD and Overlay Marks

Scatterometry measurement targets consume a significant area of the wafer for both metrology of CD and overlay. This wafer area becomes very valuable as design rule shrinks. Currently, scatterometry overlay marks may consume >35×70 um space for each xy scatterometry overlay target group or mark on the wafer. These are used only for overlay measurements and therefore the manufacturers consider the loss of wafer space as undesirable. Therefore, it is desirable to reduce the total wafer area required for measurement targets or measurement features. Changes to optical system design to enable measurements on smaller targets may result in greater complexity of the optical system and potentially compromise measurement performance. In scatterometry overlay measurement as describe herein, the target area typically consists of four gratings for each axis (X and Y). Each of these gratings is typically larger than 15×15 um with a limited opportunity to shrink it further using conventional techniques. Each grating is composed of a first layer grating (e.g. STI) and a top layer grating (e.g. gate.). Each of the two layers has a programmed offset, which is typically smaller than the pitch of the top grating. In many cases the top layer is photoresist. An overlay measurement is achieved by analyzing the spectra of a reflected light from each of these gratings.

In scatterometry critical dimension (CD) measurement, the target area typically consists of a single grating, which may be positioned along either axis (X or Y). In some cases, the target area may include multiple gratings for each axis (X and Y). Each of these gratings is typically about 50×50 um. The measurement is typically performed on a single process layer target with no pattern underneath. This measurement is typically done on a photoresist pattern following a resist development step in a lithographic patterning process or following an etch or CMP process in other modules of the fabrication. A CD measurement is achieved by analyzing the spectra of a reflected light from the grating(s) as described in the above referenced U.S. Pat. No. 6,590,656 by Xu, et al.

In accordance with one embodiment of the present invention, the scatterometry CD marks and the scatterometry overlay marks are combined to enable the fab to save wafer space and to print larger scatterometry overlay marks with no impact on the wafer scribe-line. The combined mark is constructed with a scatterometry CD target as the first layer and scatterometry overlay target patterns as the top layer. This results in zero or minimal additional scribe line space allocated to scatterometry overlay.

FIG. 12 is a diagram of a combined mark 1200, in accordance with one embodiment of the present invention. The combined mark 1200 provides for both scatterometry CD measurement and scatterometry overlay measurement at different steps in the wafer manufacturing process. The combined mark 1200 is formed on at least two layers of the wafer, particularly a first layer L1 and a top layer L2. The first layer L1 includes scatterometry CD/profile targets 1202 and the top layer L2 includes scatterometry overlay targets 1204. Although shown as separate layers in the diagram, it should be noted that the scatterometry overlay targets 1204 are built on (over) the scatterometry CD profile targets 1202. The scatterometry CD/profile targets 1202 form an L1 scatterometry CD mark, which can be measured to determine CD after formation or processing of the L1 pattern. The scatterometry overlay targets 1204 cooperate with the scatterometry CD/profile targets 1202 to form an L2-L1 scatterometry overlay mark, which can be measured to determine overlay between the layers after formation of the L2 pattern (which comes after L1 pattern formation). As should be readily apparent, this method may be repeated to produce a layer 2 L2 scatterometry CD/profile target(s) followed by a Layer 3 L3 pattern to create an L3-L2 scatterometry overlay mark or target area.

The configuration of the scatterometry CD/profile targets 1202 and scatterometry overlay targets 1204 may be widely varied. In the illustrated embodiment, the scatterometry CD/profile targets 1202 disposed on L1 include a first grating 1206 oriented in a first direction and a second grating 1208 oriented in a second direction. The first direction may be orthogonal to the second direction. By way of example, the first grating 1206 may include vertical lines while the second grating 1208 may include horizontal lines. In addition, the scatterometry overlay targets 1204 disposed on L2 include a first group of gratings 1210 and a second group of gratings 1212. Both the first and second groups of gratings 1210, 1212 include one or more gratings 1214. The number of gratings 1214 may be widely varied. In one implementation, both the first and second groups 1210 and 1212 include four gratings 1214. The gratings 1214A in the first group 1210 are oriented in the first direction, and the gratings 1214B in the second group 1212 are oriented in the second direction. By way of example, the gratings 1214A in the first group 1210 may include vertical lines while the gratings 1214B in the second group 1212 may include horizontal lines.

In order to produce an L2-L1 overlay mark, the first group of gratings 1210 is positioned over the first grating 1206 of the CD/profile targets 1202, and the second group of gratings 1212 is positioned over the second grating 1208 of the CD/Profile targets 1202. This places gratings with similarly directed lines together, i.e., vertical lines with vertical lines and horizontal lines with horizontal lines. The first group of gratings 1210 cooperates with the first grating 1206 of the CD/profile targets 1202 and the second group of gratings 1212 cooperates with the second grating 1208 of the CD/profile targets 1202. The alignment between layers is determined by the shift produced between the corresponding lines of these cooperating structures. The vertical lines, for example, may be used to determine X overlay and the horizontal lines, for example, may be used to determine Y overlay.

Although the first and second gratings 1206 and 1208 of the CD mark are shown together, it should be noted that they may be placed apart. When implemented apart, the first group of gratings 1210 and the second group of gratings 1212 would also be placed apart, i.e., the first group of gratings 1210 goes with the first grating 1206 and the second group of gratings 1212 goes with the second grating 1208.

The advantages of combining overlay and CD marks are numerous. Different embodiments or implementations may have one or more of the following advantages. One advantage of combining marks is in the ability to reduce the need for additional wafer space for scatterometry overlay targets. Another advantage is that larger scatterometry overlay targets may be allowed if they do not require as much additional scribe line space. Larger scatterometry overlay targets may make optical design or optics manufacturing easier and may provide better scatterometry overlay metrology performance than on smaller scatterometry overlay targets.

Combination of Scatterometry Overlay and CDSEM

The purpose of this embodiment is to enable measurement of critical dimensions on a semiconductor wafer with an electronic microscope (CD-SEM) and measurement of overlay using scatterometry on the same measurement system or using linked measurement systems sharing at least part of a robotic wafer handling system. The established methods of measuring critical dimensions and overlay commonly require scheduling and operating separate measurement systems. One disadvantage of the established methods of measuring critical dimensions and overlay on separate measurement systems is the additional time required to schedule and run separate operations on separate metrology tools. Another disadvantage is the redundancy of common parts and the costs associated therewith.

In order to overcome these disadvantages, a metrology system that combines Scatterometry Overlay and CDSEM may be provided. In one embodiment, a scatterometry overlay measurement (SCOL) system is integrated with a CDSEM system such that the CDSEM and SCOL systems share at least part of the robotic wafer handling system and/or data systems. Alternatively, the CDSEM and the scatterometry overlay systems may be separate systems capable of independent operation, but linked in such a way that they share at least part of a robotic wafer handling system.

In operation, a wafer, a group of wafers, or batch of multiple wafers may be introduced to the combined metrology system by loading the wafer container onto the robotic wafer handling system dedicated to this combined metrology system. Measurement recipes may be selected specifying CDSEM measurements on some or all of the wafers and scatterometry overlay measurements on some or all of the wafers. The CDSEM measurements and the SCOL measurements may be specified together in one or more recipes, or may be specified in separate recipes. The CDSEM and SCOL measurements may be done on the same wafers or on different wafers or on some of the same wafer and some different wafers. The CDSEM and SCOL systems may operate in parallel, or in series.

One example of the combined metrology system would be integration of a scatterometry system capable of scatterometry overlay measurements (such as a spectroscopic ellipsometer, spectroscopic polarized reflectometer, or +/−1 order diffraction scatterometer) inside a CD-SEM such as any of those manufactured by KLA-Tencor of San Jose, Calif. Another example of a combined metrology system would be a linked system comprising a scatterometry overlay system, a CDSEM such as any of those manufactured by KLA-Tencor of San Jose, Calif., a robotic handler, and a wafer scheduling system. Communication to factory automation and/or factory information, and/or factory process control systems may be through separate communication or automation systems or may be at least partially or completed shared.

One advantage of the combined CDSEM and SCOL metrology system is the reduction in overall time required to complete scheduling and/or performing the CDSEM and scatterometry overlay measurements. At least one queue delay time may be eliminated. Performing CDSEM and overlay measurements in parallel can save at least part of the time required for separate measurement operations.

FIGS. 13A-13D show variations of a combined metrology tool 1300, in accordance with several embodiment of the present invention. In all the figures, the combined metrology tool 1300 includes a robotic wafer handling system 1302, a critical dimensioning scanning electron microscope (CD-SEM) 1304, a scatterometry overlay (SCOL) measurement instrument 1306, a wafer load position A 1308 and a wafer load position B and 1310, respectively. The robotic wafer handling system 1302 is configured to transfer wafers to and from the CD-SEM 1304 and SCOL measurement instrument 1306 as well as to and from the wafer load positions A and B 1308 and 1310. The critical dimensioning scanning microscope 1304 is configured to measure the critical dimensions that may include, for example, linewidth, top linewidth, via diameter, sidewall angle and profile. The scatterometry overlay measurement instrument 1306 is configured to measure the overlay as for example between two layers disposed on the wafer. The wafer load position A and wafer load position B are configured to hold one or more wafers. In most cases, they hold a plurality of wafers. The wafers may be from the same lot of from a different lot.

In FIGS. 13A and D, the CD-SEM 1304 and the SCOL measurement instrument 1306 are separate systems that are integrated via the robotic wafer handling system 1302. In FIG. 13B, the SCOL measurement instrument 1306 is integrated into the CDSEM 1304. In FIG. 13C, the SCOL measurement instrument 1306 is integrated into the robotic wafer handling system 1302.

In one operation, some of the wafers from wafer load position A and/or B have critical dimensions measured at the CD-SEM and thereafter have overlay measured at the scatterometry overlay measurement instrument. The wafer can be measured by both processes without being removed from the system, i.e., the wafer handling as well as the throughput issues associated therewith are reduced. In another operation, some wafers from wafer load position A and/or B have critical dimensions measured at the CDSEM and some other wafers from wafer load position A and/or B have overlay measured on SCOL measurement instrument. In any of these operations, the CDSEM and SCOL measurement instrument can proceed independently and simultaneously.

FIG. 14 is a flow diagram 1400 using a combined metrology tool, in accordance with one embodiment of the present invention. The method generally includes step 1402 where a group of wafers are received by the metrology tool. By way of example, the wafers may be a wafer lot that is loaded at position A in FIG. 13. Following step 1402, the process flow 1400 proceeds to step 1404 where the critical dimensions of a wafer from the group of wafers is measured. By way of example, the critical dimension measurements may be performed by a CDSEM as for example the CDSEM shown in FIG. 13. The process flow 1400 also proceeds to step 1406 where the overlay of a wafer from the group of wafers is performed by a SCOL measurement instrument as for example the instrument shown in FIG. 13. Steps 1404 and 1406 may be performed at the same time on different wafers. Steps 1404 and 1406 may be performed on the same wafer in a sequence of operations, as for example, from CD to overlay or from overlay to CD. The transferring of the wafer may for example be performed by the robotic system shown in FIG. 13. When all the measurements are performed, the process flow proceeds to step 1408 where the group of wafers are released from the metrology tool.

Combination of Scatterometry Overlay and Other Metrology or Inspection Methods

Scatterometry overlay may be combined with scatterometry profile or scatterometry critical dimension systems, or other semiconductor metrology or inspections systems. Scatterometry overlay may be integrated with a semiconductor process tool, for example a lithography resist process tool (also known as a resist track). Integration of metrology systems with process systems and combinations of metrology systems are described in (1) U.S. patent application, having patent Ser. No. 09/849,622, filed 04 May 2001, entitled “METHOD AND SYSTEMS FOR LITHOGRAPHY PROCESS CONTROL”, by Lakkapragada, Suresh, et al. and (2) U.S. patent, having U.S. Pat. No. 6,633,831, issued 14 Oct. 2003, entitled “METHODS AND SYSTEMS FOR DETERMINING CRITICAL DIMENSION AND A THIN FILM CHARACTERISTIC OF A SPECIMAN”, by Nikoonahad et al, which applications are incorporated herein by reference in their entirety.

Continuously Varying Offset Marks and Methods for Determining Overlay

The following is a description of a continuously varying offset mark and various techniques used to extract overlay information from this kind of mark, in accordance with another embodiment of the present invention. The continuously varying offset mark is a single mark that includes over laid periodic structures, which have offsets that vary as a function of position. By way of example, the periodic structures may correspond to gratings with different values of a grating characteristic such as pitch. Small differences in pitch produce offsets that vary from center to edge of the mark.

A method for utilizing the continuously varying offset mark includes determining the center of symmetry of the continuously varying offset mark and comparing it to the geometric center of the mark. If there is zero overlay, the center of symmetry tends to coincide with the geometric center of the mark. If overlay is non zero (e.g., misalignment between two layers), the center of symmetry is displaced from the geometric center of the mark. The displacement in conjunction with a preset gain factor associated with the mark is used to calculate the induced overlay error.

Embodiments of the continuously varying offset mark are discussed below with reference to FIGS. 15-24. However, those skilled in the art will readily appreciate that the detailed description given herein with respect to these figures is for explanatory purposes as the invention extends beyond these limited embodiments.

FIG. 15 is a flow diagram illustrating a method 1500 of determining overlay error, in accordance with one embodiment of the present invention. The overlay method 1500 begins at block 1502 where at least one overlay target is provided. The overlay target includes at least a first periodic structure and a second periodic structure. The periodic structures may be one-dimensional periodic structures (measurable in X or Y) or they may be two-dimensional periodic structures (measurable in both X and Y). The first periodic structure is placed in a first layer of a substrate and the second periodic structure is placed in a second layer of a substrate. In most cases, the periodic structures are positioned directly over one another in the various layers (over laid). In fact, the overlay target may be referred to as a periodic structure on periodic structure overlay target, and more particularly a grating on grating overlay target.

The periodic structures have many characteristics or attributes including, for example, line width, pitch, spacing, CD, rotation, etc. The periodic structures in the overlay target are designed to be substantially similar in most respects except for one particular attribute. With regards to the particular attribute, each of the periodic structures in the overlay target is configured to have a different value for the particular attribute. The particular attribute may, for example, correspond to any of those mentioned above. In one embodiment, the particular attribute corresponds to pitch. As used herein, pitch is the minimum displacement to which a periodic structure is invariant. In another embodiment, the particular attribute corresponds to rotation. These attributes have been found to work particularly well.

With regards to the values, the upper periodic structure may have a value that is larger or smaller than the lower periodic structure. In either case, it is generally preferable to keep the difference small. The difference may be less than 10%, more particularly, less than 5.0%, even more particularly, less than or equal to 1.0%. In one embodiment, the difference is between about 0.01% and about 10%. In another embodiment, the difference is between about 0.10% and about 5%. In yet another embodiment, the difference is between about 0.10% and 0.99%. In one particular embodiment of the present invention, the difference is equal to about 1%. These ranges and numbers have been found to work particularly well.

Because pitch is one of the most accurately controlled parameters of the microlithography process, it is believed that manufacturing continuously varying offset marks with pitch differences as small as 0.01% should be easily attainable. There already exists a known OPC method for controlling edge positions on the wafer to extreme accuracy. This technique, known as a resolution enhancement technique is based on lines constructed of partial duty cycle features along the edge of the feature, which are sub-resolution on the wafer. This technique may allow reticle manufacture with relatively high accuracy pitch control. A similar technique can be used for rotation to generate small angle differences, on the order of milliradians.

The small difference in the values of the selected attribute for the periodic structures is configured to generate a preset gain factor for the overlay target. The gain factor may for example be calculated by dividing one of the values by the difference in the values, and more particularly by dividing the value of the lower periodic structure by the difference in values. Alternatively, the gain factor may be calculated by averaging the first and second values and thereafter dividing the average by the difference in the first and second values. By way of example, an equation for the gain factor is shown below: Gain factor=V1/(V1−V2) where V1=the first value of the attribute; and V2=the second value of the attribute

To cite an example, suppose the attribute is pitch and the pitch for the first periodic structure is 400 nm and the pitch for the second periodic structure is 404 nm (a difference of 1.0%). The gain would be calculated as follows: Gain factor=400/(400−404)=100

The size of the difference and thus the size of the gain factor depends on various things such as dynamic range and sensitivity. Dynamic range is the range of induced overlay over which a measurement can be performed accurately. Sensitivity, on the other hand, is the minimum induced overlay that can be detected by the technique. There is typically a trade off between dynamic range and sensitivity. It is generally believed that if a large dynamic range is needed, the difference in the values of the periodic structures should be made larger (smaller gain factor) and if good sensitivity is needed, the difference in the values of the periodic structures should be made smaller (large gain factor).

Following block 1502, the method proceeds to block 1504 where a spatial array of optical data is obtained for the overlay target(s). By spatial array, it is generally meant that the optical data is a set of data points as a function of spatial coordinates on the target. That is, the optical data includes a plurality of data points that are each related to a physical spatial point along the target. In one example, the optical data includes a plurality of data points that vary spatially along an axis that is transverse to the measurable elements of the overlay target. In the case of a 1D grating, the axis generally lies perpendicular to the lines of the grating. In the case of a 2D periodic structure, multiple axes may be used, one for each direction of the periodic structure. In either case, the optical data points are located at different positions across the length of the overlay target.

The optical data may be collected in a single optical measurement (e.g., imaged), or it may be collected by spatially and sequentially performing a plurality of optical measurements along an axis (e.g., scanning or step and repeat). In imaging, for example, all the spatially dependent data points along the target are grabbed simultaneously. In scanning, for example, the data spatially dependent data points along the target are collected incrementally.

The optical data and the manner in which the optical data is obtained may be widely varied. By way of example, the optical data may include spectra, white light reflection intensity, angularly resolved scatterometry signal, spectroscopic ellipsometer signal, some other polarization dependent signal, and/or the like. Furthermore, the optical data may be obtained via imaging scatterometry, spectroscopic ellipsometry, angular scatterometry, and the like. One advantage of performing these techniques (spectroscopic characterization of an image) is that significant enhancement in contrast and hence information content per unit target area can be achieved.

In one embodiment, the optical data can be obtained by performing parallel acquisition and processing scatterometry by either an imaging or scanning mechanism. These techniques have the advantage of enhanced measurement speed and enable many options to reduce target size. In one implementation, the target may be scanned at constant or any other velocity profile, instead of independent target acquisition. This may be achieved by moving either the optics or the wafer.

In another embodiment, the optical data can be obtained by performing imaging spectroscopic scatterometry (reflectometry) using a two dimensional CCD, and assigning one axis for spectral information and the other for a single spatial dimension. For example, by using cylindrical lens illumination and either a grating or prism spectrometer to create a dispersed line image of the illuminated region on a CCD.

In another embodiment, the optical data can be obtained by performing imaging spectroscopic scatterometry (reflectometry) using a two dimensional CCD, where both axes are assigned spatial dimensions, e.g., the spectral dimension is assigned to the temporal domain. Specifically, as in the case of Fourier transform interferometery where an image is split into two wavefronts at a beam splitter and a time domain continuously or discretely varying path difference is introduced between them. The Fourier transform of the time dependent signal at each pixel then becomes the spectrum at that point in the image. In this case, the difference analysis can be carried in the Fourier or optical path difference (time) domain with out transforming the information into the spectral domain. This may for example be accomplished using either Michelson or Sagnac interfometers.

Although several embodiments for obtaining the spatial array of optical data have been discussed, it should be noted that these are not a limitation and that any of techniques mentioned herein may be used to acquire the spatial array of optical data.

Following block 1504, the method proceeds to block 1506 where the center of symmetry of the overlay target is determined based on the spatial dependence of the optical data (the optical data is returned or converted to the spatial domain). The center of symmetry is the point about which the observed structure is invariant to reflection, i.e., the point where the optical data is substantially equal (identical) on both sides of the point. When overlay is zero, the center of symmetry tends to coincide with the geometric center of the target. When overlay is non zero, the center of symmetry lies somewhere other than the geometric center of the target.

Many algorithms can be used to determine the center of symmetry from the spatial array of optical data.

In one embodiment, self correlation is used to determine the center of symmetry. In self correlation, an array of data is replicated in reverse order and shifted with respect to the source data. The correlation function between the two data arrays is calculated using any of a number of known correlation metrics for each shift position. The correlation is plotted as a function of shift and the maximum of the correlation function is the position of maximum self correlation, i.e. the center of symmetry.

In another embodiment, the center of symmetry is determined by a null point method. In this method, an array of spectra is obtained. The array of spectra is a series of spectra from different points along the grating. This may be accomplished for example by measuring the overlay target using scatterometry. Thereafter, a determination is made as to whether or not overlay error is present. This is generally accomplished by analyzing the difference between spectra measured at equidistant points from the grating edges. For zero overlay, the difference spectra should be zero over the whole spectral range. If an overlay error is present, the difference spectra from equidistant points should be non-zero. If non-zero, the null point for the array of spectra is determined. As should be appreciated, if the overlay is zero, then points equidistant from the center tend to be the same, and if not equal to zero, then there will be some point (different centerline) where things will be equidistant from that point. This point is the null point.

The null point can be found by systematically shifting the center point or folding point of the array of spectra and analyzing the difference between the spectra measured at equidistant points from the actual and shifted center points of the array of spectra. The null point is the center point that produces a difference spectra that is substantially zero over the entire spectral range (spectra is substantially identical on both sides of the point and therefore the difference spectra is substantially zero). The null point is associated with a particular spatial location, i.e., the center of symmetry.

An alternate method to determine the null point is to plot the amplitude of the difference spectra over all wavelengths, or a selected spectral band, as a function of position in the periodic structure. This tends to produce a smooth varying function that passes through zero. The zero point is the null point described above.

Following block 1506, the method proceeds to block 1508 where a distance X between the center of symmetry and the geometric center of the overlay target is calculated. The geometric center of the overlay target is the physical center of the overlay target by design. For example, if the overlay target is 20 microns wide, then the geometric center of the overlay target is at 10 microns.

To cite an example, suppose the geometric center of the overlay target is at 10 microns and the center of symmetry is found to be 11 microns. The distance X would be calculated as follows: X=11−10=1 micron

Following block 1508, the method proceeds to block 1510 where the overlay error is determined by dividing the distance X by the gain factor of the overlay target. The equation for calculating overlay error may correspond to the following:

${OVL}_{error} = {\frac{X}{{Gain}\mspace{14mu}{factor}} = \frac{X*V\; 1}{\left( {{V\; 1} - {V\; 2}} \right)}}$ where

-   OVL_(error)=the amount of misalignment between two layers -   X=the distance between the center of symmetry and the geometric     center of the target -   V1=the value of the first attribute -   V2=the value of the second attribute

Referring back to the examples given above, the overlay error would be calculated as follows:

${OVL}_{error} = {\frac{1\mspace{14mu}{micron}}{100} = {\frac{1\mspace{14mu}{micron}*\left( {400\mspace{14mu}{nm}} \right)}{\left( {{400\mspace{14mu}{nm}} - {404\mspace{14mu}{nm}}} \right)} = {10\mspace{14mu}{nm}}}}$

The present invention, in each of the various embodiments, uses overlay targets that are composed of periodic structures formed on two layers of a semiconductor wafer to provide overlay information between those two layers of the semiconductor device. The periodic structures may be formed in any of the layers on a wafer during the semiconductor fabrication process. For example, the periodic structures may be formed in a photoresist layer, a dielectric layer or a metal layer.

The dimensions and position of the periodic structures may be widely varied. In some cases, the feature size and pitch of the periodic structures are substantially equal to those of the critical devices features of the patterning step performed on the layers under test. That is, the dimensions of the features are comparable to the features of the circuit patterns. Furthermore, the periodic structures may be positioned in the circuit pattern itself, or alternatively around the edge of the circuit pattern as is conventionally done.

In accordance with the present invention, the overlay targets are configured such that the center of symmetry of the overlay target corresponds to the geometric center of the overlay target when the two layers are properly aligned, and conversely that the center of symmetry of the overlay target is displaced from the geometric center of the overlay target when the two layers are not properly aligned. Several examples of overlay targets are shown and described below.

FIG. 16 is a diagram of a one dimensional overlay target 1600, in accordance with one embodiment of the present invention. As shown, the target 1600 includes a first grating 1602 located on a first layer and a second grating 1604 located on a second layer. Furthermore, each of the gratings 1602 or 1604 includes a plurality of spaced apart parallel lines 1606. L indicates the length of grating 1604. δ indicates a value that may vary in different embodiments.

Although the two gratings are shown next to each other, it should be appreciated that this is done to better illustrate the overlay target. In most cases, the first grating is positioned directly over the second grating thereby forming a grating on grating structure as shown in FIGS. 17-20. That is, the gratings are laid over one another such that in the case of zero overlay their axes of symmetry coincide. Furthermore, although the lines are shown as horizontal lines that are vertically oriented side by side in the Y direction, it should be appreciated that the lines may also be vertical lines that are horizontally positioned side by side in the X direction. The illustrated configuration is used to determine overlay error in the Y direction while the alternate configuration (rotated 90 degrees) is used to determine overlay in the X direction. In most cases, both configurations are positioned on a substrate so that overlay error can be determined in multiple directions.

In order to produce a gain factor, in accordance with the present invention, the first grating 1602 is configured with a first value of a grating characteristic and the second grating 1604 is configured with a second value for the same grating characteristic. The grating characteristic may for example correspond to pitch or rotation. The value of the first grating 1602 may be made larger or smaller than the value of the second grating 1604 (or vice versa). The difference between the first and second values is preferably kept small. The small difference will induce an effective offset at either end of the grating, which diminishes linear as the center of the grating is approached.

The difference may be less than 10%, more particularly less than 5% and even more particularly less than or equal to 1.0%. In one embodiment, the difference is between about 0.01% and about 10%. In another embodiment, the difference is between about 0.10% and about 5%. In yet another embodiment, the difference is between about 0.10% and 0.99%. In one particular embodiment of the present invention, the difference is equal to about 1%.

Referring to FIGS. 17-20, several embodiments of the overlay target 1600 will be described. FIGS. 17A and 17B are exemplary top view diagrams and FIGS. 18A and 18B are exemplary side view diagrams of the overlay target when the first grating 1602 is positioned dver the second grating 1604. FIGS. 17A and 18A show the two grating structures when they are in perfect alignment, and FIGS. 17B and 18B show the two grating structures when they are out of alignment, i.e., when there is an overlay error between the two layers. In this particular illustrated embodiment, the grating characteristic corresponds to pitch. The pitch of the first grating 1602 is made larger than the pitch of the second grating 1604. The pitch of the first grating 1602 may for example be 1.0% larger than the pitch of the second grating 1604. As result of this difference, the offset between the lower and upper gratings 1602 and 1604 varies continuously as a function of position.

FIGS. 19A and 19B are exemplary top view diagrams and FIGS. 20A and 20B are exemplary side view diagrams of the overlay target when the first grating is positioned over the second grating. FIGS. 19A and 20A show the two grating structures when they are in perfect alignment, and FIGS. 19B and 20B show the two grating structures when they are out of alignment, i.e., when there is an overlay error. In this particular illustrated embodiment, the grating characteristic corresponds to pitch. Unlike the embodiment shown above, the pitch of the first grating 1602 is made smaller than the pitch of the second grating 1604. The pitch of the first grating may for example be 1.0% smaller than the pitch of the second grating. As result of this difference, the offset between the lower and upper gratings 1602 and 1604 varies continuously as a function of position.

FIG. 21 is a diagram of a one dimensional overlay target 1620, in accordance with another embodiment of the present invention. The overlay target 1620 is similar to the overlay target 1600 described above, except that the first layer includes a pair of gratings 1602A and 1602B rather than a single grating. During processing, the pair of gratings 1602A and 1602B are disposed directly over the second grating 1604. The pair of gratings 1602A and 1602B are spatially separated and positioned side by side. Furthermore, the pair of gratings 1602A and 1602B are similar in most respects except that they have different values for the grating characteristic. One of the gratings 1602A is configured with a value larger than the second grating 1604 while the other grating 1602B is configured with a smaller value than the second grating 1604. By way of example, the left side grating 1602A may have a pitch 1.0% larger than the pitch of the second grating 1604, and the right side grating 1602B may have a pitch 1.0% smaller than the pitch of the second grating 1604 (or vice versa).

One advantage of the twin grating approach is that it alleviates the need to determine the geometric center of the overlay target. In the twin grating approach, the upper gratings move in opposite directions as a function of the overlay error. Therefore, in order to determine overlay error, the center of symmetry for each of the gratings is found and thereafter the distance Y between the centers of symmetry is calculated (the relative offset between the left and right gratings). The distance Y is divided by two and the result is divided by the gain factor to determine the overlay error. When using the twin grating approach, the equation for calculating overlay error may correspond to the following:

${OVL}_{error} = {\frac{Y}{2*{Gain}\mspace{14mu}{factor}} = \frac{Y*V\; 1}{2*\left( {{V\; 1} - {V\; 2}} \right)}}$ where

-   OVL_(error)=the amount of misalignment between two layers -   Y=the distance between the center of symmetry for the pair of     targets produced by the left and right gratings -   V1=the value of the first attribute -   V2=the value of the second attribute

A further advantage of the twin grating approach is that it reduces the distance between the spectral sampling points reducing the impact of spatially dependent process variation on the difference spectra.

In one embodiment of the twin grating approach, the two gratings may be scanned by a two beam spectrometer or spectrally imaged by a two dimensional spectral imager during measurement of the twin grating target. In the scanning case, common mode rejection can be enabled since the difference spectra may be obtained between adjacent points, which are acquired simultaneously. In this case, the overlay calculation can proceed as in the zero crossing points (null point) method described above, however the difference spectra are acquired from side by side points.

In another embodiment of the twin grating approach, the adjacent targets are illuminated with a single wavelength uniform wavefront; an optical phase reversal in one of the reflected wavefronts by path difference insertion techniques is induced; and the two wavefronts are superimposed on a single detector or array of detectors. In the case of zero overlay, when recombined this should provide a null signal. Alternatively, if adjacent targets are illuminated with arbitrarily phase shifted wavefronts, when recombined, they should provide a specifically structured signal in the case of zero overlay.

FIGS. 22A and 22B are diagrams of a two-dimensional overlay target 1640, in accordance with one embodiment of the present invention. Unlike the one dimensional overlay targets shown above, the two dimensional overlay target 1640 is configured to generate overlay information in multiple directions as for example in X and Y. The two-dimensional overlay target 1640 includes a first periodic structure 1642 located on a first layer and a second periodic structure 1644 located on a second layer. FIG. 22A shows the two periodic structures when they are in perfect alignment, and FIG. 22B shows the two periodic structures when they are out of alignment, i.e., when there is an overlay error.

Each of the periodic structures 1642 and 1644 is periodic in both the X and the Y directions. As shown, the periodic structures 1642 and 1644 are broken up into an array of sub elements 1646 that are spaced apart and positioned in rows and columns. Like the 1-D targets mentioned above, the periodic structures 1642 and 1644 are similar in most respects except for a particular attribute of the periodic structures. The attribute may for example correspond to pitch or rotation. In either case, the first periodic structure 1642 is configured with a first value for the particular attribute and the second periodic structure 1644 is configured with a second value for the particular attribute. The value of the first periodic structure 1642 may be larger or smaller than the value of the second periodic structure 1644. In either case, the difference is typically quite small. In the illustrated embodiment, the first periodic structure 1642 has a pitch that is smaller than the pitch of the second periodic structure 1644 in both directions. By way of example, the difference between the pitches may be about 1.0%.

FIG. 23 is a top view diagram of an overlay region 1660, in accordance with another embodiment of the present invention. The overlay region 1660 includes a plurality of continuously varying offset marks 1662, and more particularly a first set of continuously varying offset marks 1662A oriented in the X direction and a second set of continuously varying offset marks 1662B oriented in the Y direction. Each set includes a pair of continuously varying offset marks, which are positioned laterally relative to each other. The laterally positioned marks may be positioned on the same axis or they may be shifted relative to one another (as shown). Furthermore, the continuously varying offset marks 1662 in each set are configured differently. One of the marks is designed with an upper periodic structure having a smaller value of a particular attribute, and the other mark is designed with an upper periodic structure having a larger value of a particular attribute. In the illustrated embodiment, the periodic structures are gratings and the particular attribute corresponds to pitch.

In addition to having multiple continuously varying offset marks 1662, the overlay region 1660 may additionally include a conventional box in box overlay target 1664. The box in box target 1664 is disposed in the center, and the continuously varying offset marks 1662 are disposed at the periphery of the overlay region 1600. The continuously varying offset marks 1662 are configured to surround the box in box target 1664. In fact, the box in box target 1664 may be disposed between each of the continuously varying offset marks 1662A and 1662B in the set of continuously varying offset marks.

FIGS. 24A and 24B are top view diagrams of an overlay region 1680, in accordance with another embodiment of the present invention. The overlay region includes a plurality of continuously varying offset targets 1682 and 1684. Each of the targets is formed as a grating on grating target. The gratings 1686A and 1686B of the first target 1682 are oriented in a first direction such as X and the gratings 1688A and 1688B of the second target 1684 are oriented in a second direction such as Y. As shown, the gratings 1686 of the first target 1682 are designed with different pitches. Furthermore, the gratings 1688 of the second target 1684 have been rotated relative to one another (e.g., tilted y gratings). By way of example, the bottom grating may be tilted in the clockwise direction, and the upper grating may be tilted in the counterclockwise direction (or vice versa). Alternatively, only one of the gratings may be tilted. It should be noted that the angle of rotation is exaggerated in this Figure and that the rotation is typically closer to 1% of a degree. Moreover, FIG. 24A shows the gratings when they are in perfect alignment, and FIG. 24B shows the gratings when they are out of alignment, i.e., when there is an overlay error.

As should be appreciated, these targets couple X and Y overlay. The center of symmetry of the rotated target is shifted by ±G·OVL(Y)+OVL(X), where G is the gain factor. The relative sign between the two terms is determined by the orientation of the tilt. Thus, the measurement of the center of symmetry of rotated target complemented by a separate measurement of OVL(X) or a measurement of another linear combination of OVL(X) and OVL(Y) (using another CVO target) completely determines both overlays. One advantage of using this set up is that the measurements only need to be performed in a single direction such as X, yet both X and Y overlay can be extracted.

Referring back to FIGS. 15 and 16, the dynamic range and accuracy of the methods described therein can be calculated using the following methodology. First, the position determination accuracy within the grating is assumed to be AΔ. Second, assuming the pitch is greater on the second layer, the target length can be defined as L on the first layer and (1+δ)L on the second layer. In this case, the dynamic range is less than or equal to +/−δL/2 since this is the offset between layers at the extreme edge of the gratings. The accuracy of overlay measurement, on the other hand, can be identified by Δx* gain factor (or L/δL). It should be pointed out that the above mentioned analysis is equally valid for the rotated target as for the pitch change case, however δ would represent the angle in radians.

Using the numbers from the previous example, if the periodic structures have pitches of 400 and 404 nm and the target is 20 microns wide, the dynamic range of the measurement will be +/−100 nm. If the positioning accuracy of the scanning system Δx is +/−50 nm, the accuracy of overlay measurement will be +/−0.5 nm. In the case of an imaging system, in which the zero crossing can be calculated to sub-pixel accuracy, the overlay measurement accuracy should be considerably better.

In an alternate embodiment of the invention, the over laid periodic structures of the continuously varying offset mark can be offset or shifted from one another rather than being aligned along an axis of symmetry as shown in FIG. 2. In cases such as this, the amount of offset is taken into account when determining the overlay error. For example, block 108 in FIG. 1 may further include adjusting the distance X based on the relative offset before calculating the overlay error in block 110. By way of example, the relative offset divided by two may be subtracted or added to the distance X. Using the example of above, and supposing the designed relative offset between periodic structures is 1 micron, then the distance X would be calculated as 11-10-(½) or 0.5 microns.

In another embodiment of the present invention, the method described in FIG. 15 includes additional blocks for determining the correct overlay budget of a circuit pattern. The additional blocks are performed after determining the overlay error on the specific target under test. The overlay budget is the amount of overlay error that is acceptable for the circuit to function properly, i.e., the misalignment between patterns may lead to device failure if the overlay budget is exceeded. In most cases, the overlay budget includes a built in safety margin. The safety margin is used to overcome the fact that the designers do not know how the device will behave in a gray area of overlay error, i.e., the area between full device functionality and a device that does not function properly. While the safety factor works, it does require the designer to add a buffer to the overlay budget and therefore some functional devices may be scrapped. Alternatively, it may require the designer to enlarge the size of the device and hence reduce the number of devices per die. The additional blocks are configured to help monitor the affects of overlay on the features of the device. By knowing their affect on the features, a more accurate overlay budget may be determined, i.e., without a safety margin or with limited safety margin.

The additional blocks include a first block where structural characteristics or geometric parameters are extracted from the optical data as a continuous function of overlay error. The structural characteristics may include for example side wall angle, edge roughness, other critical dimensions, and/or the like. The structural characteristics may be obtained using the same optical data used to determine the overlay error. Once the structural characteristics are obtained, they can be analyzed along with the overlay error to generate the structural characteristics as a function of overlay error. Once the function is determined, the method proceeds to a second block where the overlay budget or lot disposition parameter is determined based on the structural characteristics as a function of overlay error. The designer can determine what structural characteristics are acceptable and thereafter select the correct overlay budget without using a safety margin. In one example, lot dispositioning can be performed using structural metrics in conjunction with overlay data.

Another example for a block where structural characteristics are determined. The center of symmetry approach relies on quantities, which are symmetric; i.e., they are unchanged when the sign of the overlay is reversed (keeping the magnitude of the overlay fixed). There are some ellipsometry or reflectometry signals, however, which are anti-symmetric in nature. Such a quantity changes sign when the overlay changes sign. An asymmetry in any of the lines forming any of the periodic grating in the target would lead to a breakdown of this anti-symmetry. This block measures one or several anti-symmetric properties, and determines the degree of structural asymmetry in the target as a function of overlay.

The continuously varying offset mark may have one or more of the following advantages. One advantage of the continuously varying offset mark is that an entire period is not needed in order to determine overlay (only a portion is needed). In the example given above, for example, only half of the period is needed. This is not a limitation, however, as it can be envisioned that less than half of a period can be used. In contrast, the Moiré technique needs to see the entire period. Another advantage of the present invention is that the gain is very high (since the difference is very small). This allows a significant reduction in the size of the target. Furthermore, by introducing a gain factor between the measured position of the center of symmetry and the induced overlay, the sensitivity to all the sources of inaccuracy and bias in an imaging tool such as aberrations and illumination non-uniformity are reduced by the same gain factor. Another advantage of the continuously varying offset mark is that the techniques relies on finding the center of symmetry, and cares nothing for the beat pitch (which is a requirement in the Moire technique). The beat pitch is the observed pitch when two periodic structures of differing pitches are placed on top of each other (combined). Another advantage of the continuously varying offset mark is that only a single target is needed and therefore space can be conserved.

While the invention has been described in terms of several preferred embodiments, there are alterations, permutations, and equivalents, which fall within the scope of this invention. For example, although the continuously varying offset mark is directed primarily at over laid periodic structures, in some cases, it may be possible to determine overlay with periodic structures positioned at different locations (e.g., side by side rather than on top of each other). In cases such as this, a shearing imaging system that takes two images and combines them into one may possibly be used to extract the overlay information. It should also be noted that there are many alternative ways of implementing the methods and apparatuses of the invention. For example, although X and Y overlay targets are typically constructed similarly, in some cases, it may be desirable to construct them differently. For example, the X overlay target may be constructed with pitch differences while the Y overlay target may be constructed with rotational differences. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention. 

1. An overlay target comprising: a first periodic structure located on a first layer, the first periodic structure having a first value of an attribute of a periodic structure; and a second periodic structure located on a second layer, and positioned over the first periodic structure, the second periodic structure having at least a second value of the attribute of a periodic structure, wherein the first and second values have a substantially small difference and wherein each of the first and second periodic structures is for measuring overlay in two directions and wherein the attribute is rotation or pitch, wherein the length of the first and second periodic structures is independent from the substantially small difference.
 2. The overlay target as recited in claim 1 wherein the second value is larger than the first value.
 3. The overlay target as recited in claim 1 wherein the second value is smaller than the first value.
 4. The overlay target as recited in claim 1 wherein the first and second periodic structures each consist of a plurality of sub segments oriented in two directions.
 5. The overlay target as recited in claim 1 wherein the difference between the first and second values is less than 10%.
 6. The overlay target as recited in claim 1 wherein the difference between the first and second values is less than 5%.
 7. The overlay target as recited in claim 1 wherein the difference between the first and second values is less than or equal to 1.0%.
 8. The overlay target as recited in claim 1, wherein the small difference is configured to generate a preset gain for the overlay target.
 9. The overlay target as recited in claim 1 wherein the difference between the first and second values is between about 0.01% and about 10%.
 10. The overlay target as recited in claim 9 wherein the difference between the first and second values is between about 0.10% and about 0.99%.
 11. An overlay target comprising: a first portion located on a first layer, the first portion including a first grating having a first value of a pitch; a second portion located on a second layer and centered above the first portion, the first and second portions having substantially a same dimensional profile, the second portion including a second grating and third grating that are positioned side by side, the second grating having a second value of a pitch, and the third grating having a third value of a pitch, the second value being slightly larger than the first value, the third value being slightly smaller than the first value, wherein the second grating and the third grating extend substantially over the first grating.
 12. The overlay target as recited in claim 11 wherein a difference between the first value and the second and third values is 1.0%. 